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Abstract 

This  thesis  investigates  the  feasibility  of  optically  implementing  a  real-time, 
pattern  recognition  system  using  correlation  techniques  in  a  position,  scale,  and  ro¬ 
tation  invariant  (PSRI)  feature  space.  Input  target  templates  were  optically  Fourier 
transformed  using  an  improved,  high-resolution,  high-pass  filter  positioned  in  the 
focal  plane  of  the  transforming  lens.  A  logarithmic-polar  coordinate  transform  of 
the  magnitude-squared  of  the  Fourier  transform  (|FT|2)  was  performed  with  an  im¬ 
proved  method  of  focusing  the  magnified  |FT|2  onto  a  computer  generated  hologram 
(CGH),  which  was  shown  to  scale  linearly  along  the  horizontal  axis  and  logarithmi¬ 
cally  along  the  vertical  axis.  Optical,  matched-filter  correlations  on  the  magnitude- 
squared  Fourier  transform  logarithmic-polar  (FLRT)  feature  space  were  performed 
using  thermoplastic,  phase-relief  holography  and  Vander  Lugt  filtering. 

Correlation  results  prove  that  scale  and  rotation  changes  of  the  input  can  he 
predicted  accurately  based  on  linear  shifts  of  the  correlation  peak.  Also,  the  FLRT 
feature  space  is  shown  to  provide  excellent  discrimination  for  multiple-input,  scenes. 
The  need  for  a  cyclic  correlation  is  verified,  and  digital  simulations  prove  the  validity 
of  the  optical  results. 


POSITION,  SCALE,  AND  ROTATION  INVARIANT 
OPTICAL  PATTERN  RECOGNITION 

FOR 

TARGET  EXTRACTION  AND  IDENTIFICATION 

I.  Introduction 

For  many  years  man  has  sought  an  algorithm  that  would  enable  him  to  de¬ 
velop  a  machine  that  can  detect,  locate,  and  identify  targets.  Obviously,  there  are 
numerous  applications  for  such  a  machine.  For  example,  machine  vision  could  allow 
a  robotic  system  to  maneuver  in  a  dynamic  environment  while  performing  aircraft 
maintenance  on  a  flight  line.  Machine  vision  could  also  be  used  in  designing  au¬ 
tonomous  weapons  that  seek  out  specific  targets.  The  military  has  a  need  to  develop 
just  such  autonomous  systems  to  detect  and  classify  potential  targets  for  use  in  mis¬ 
sile  guidance  and  strategic  surveillance  [9:1].  However,  autonomous  pattern  recogni¬ 
tion  systems  must  be  able  to  accurately  detect  and  classify  targets  in  real-time  and 
in  the  presence  of  scene  irregularities  like  background  noise,  illumination  variations, 
and  variations  in  target  size,  position,  and  in-plane  rotation.  While  much  research 
has  gone  on  in  both  digital  and  optical  techniques  for  pattern  recognition,  a  truly 
autonomous,  real-time  system  which  accurately  classifies  targets  embedded  in  their 
natural  surroundings  has  not  been  developed. 

/./  Background 

Pattern  recognition,  under  all  but  the  most  restrictive  conditions,  is  a  very  un¬ 
structured  field.  Potential  targets  in  a  scene  may  appear  in  various  sizes,  rotations, 
or  even  be  partially  occluded  by  other  objects  within  the  scene.  With  the  over¬ 
whelming  number  of  possible  variations,  using  digital  techniques  alone,  even  with 
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the  most  sophisticated  algorithms,  becomes  unmanageable  when  only  a  few  variables 
are  allowed  to  change.  Since  optical  systems  process  entire  scenes  m  parallel  and 
do  so  in  the  time  it  takes  for  the  light  to  travel  through  the  system,  optical  pattern 
recognition  has  become  increasingly  more  popular  and  promising  [9]. 

Correlation  between  an  input  image  (scene)  and  a  template  is  the  basic  ap¬ 
proach  used  in  most  optical  (as  well  as  digital)  pattern  recognition  algorithms  [9:2]. 
However,  correlation,  when  implemented  using  Fourier  transforms,  is  not  a  scale 
invariant  transformation.  Casasent  and  Psaltis  have  shown  that  coherent,  optical 
correlations  which  are  both  scale  and  shift  invariant  can  be  accomplished  using  a 
combined  Fourier-MeHin  transform  [3,  4].  With  this  Fourier-Mellin  technique,  in 
conjunction  with  Vander  Lugt  filtering  [6:171-184],  Casasent  and  Psaltis  have  devel¬ 
oped  a  position,  scale,  and  rotation  invariant  (PSRI)  transformation  for  objects  in 
pristine  backgrounds  [3:78]. 

In  1980,  Horev,  during  his  thesis  work  at  the  Air  Force  Institute  of  Technol¬ 
ogy  (AFIT),  developed  an  algorithm  which  could  be  used  for  locating  objects  in 
two-dimensional  intensity  images  without  prior  knowledge  of  position,  scale,  or  ro¬ 
tation  of  the  object  [7],  Horev’s  algorithm  used  an  approach  similar  to  Casasent  and 
Psaltis  to  demonstrate  recognition  with  translation,  scale,  and  rotation  invariance  in 
high  clutter  scenes.  Kobel  and  Martin,  in  1986,  digitally  implemented  Horev's  algo¬ 
rithm  on  the  VAX  11/780  system  at  AFIT.  Using  modified  phase  correlation  and  a 
Blackman  smoothing  function,  they  also  improved  Horev’s  algorithm  to  enhance  its 
ability  to  determine  correlation  peaks  needed  for  target  classification  and  extraction 
[9].  The  algorithm  is  now  known  as  the  Kobel-Martin-Horev  algorithm. 

In  1987,  Mayo  attempted  an  optical  implementation  of  portions  of  the  Kobel- 
Martin-Horev  algorithm  using  a  computer  generated  interferogram  (also  known  as  a 
computer  generated  hologram  (CGII))  and  a  magneto-optic  spatial  light  modulator 
(SLM)  [12].  Detailed  descriptions  of  the  CGII  and  the  SLM  are  given  in  Appendices 
B  and  C’,  respectively.  He  successfully  implemented  a  PSRI  feature  space*  but  did 
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not  complete  optical  implementation  of  the  matched-filter  correlations.  Also  in  19S7, 
Miazza  improved  and  extended  the  Kobel-Martin-Horev  algorithm  and  developed  a 
model  for  the  optical  implementation  of  the  improved  algorithm.  The  essence  of 
Miazza’s  work  involves  phase  extraction  and  reinsertion  using  optical  methods  [14]. 

Troxel  used  Kobel  and  Martin’s  improved  version  of  Horev’s  algorithm  to  im¬ 
plement  target  recognition  using  range  data  and  neural  networks  [18].  He  used 
segmented  images  of  tanks,  trucks,  and  jeeps  in  a  digital  implementation  of  Kobel 
and  Martin’s  algorithm  to  prepare  autocorrelation  and  cross-correlation  peaks  of  the 
PSRI  space.  The  segmented  images,  both  hand  segmented  and  computer  segmented, 
were  binary  silhouette  templates  created  from  the  actual  range  images  [l8:Sec  II. l]. 
Using  a  multilayer  perceptron  implementation  of  a  neural  network,  Troxel  attempted 
to  classify  random  input  scenes  of  trucks,  tanks,  and  jeeps  by  training  the  neural 
network  with  samples  of  autocorrelations  and  crosscorrelation  of  the  objects  [18:Sec 
VI, 6],  The  neural  network  was  also  a  digital  implementation  and  not  an  actual  hard¬ 
ware  network.  Although  Troxel’s  work  was  successful  in  proving  the  validity  of  the 
algorithm,  the  classification  of  targets  was  still  done  digitally  and  its  implementation 
was  far  from  real-time. 

1.2  Problem  Definition 

The  thesis  problem  is  to  investigate  the  hybrid  pattern  recognition  work  of 
Kobel  and  Martin  and  of  Mayo  and  Miazza  in  an  effort  to  determine  the  feasibility 
of  a  real-time,  optical  pattern  recognition  system.  The  system  would  be  invariant 
to  the  position,  scale,  and  rotation  changes  of  objects  in  realistic  input  scenes  and 
could  be  used  to  locate  and  classify  targets  in  real-time. 

1.3  Scope  of  Thesis 

This  thesis  focuses  on  investigating  the  optical  implementation  of  correlation 
techniques  used  in  extracting  and  identifying  targets  from  raw  input  scenes.  This  was 
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accomplished  by  first  readdressing  and  improving  upon  the  work  of  Mayo  through 
improved  optical  techniques,  and  then  by  demonstrating  and  evaluating  portions  of 
the  Miazza  design.  In  particular,  Vander  Lugt  filter  correlations  of  targets  in  the 
magnitude-squared  Fourier  transform  In  r  —  6  feature  space  were  constructed  and 
evaluated. 

1.4  Approach 

To  prepare  for  the  testing  of  the  individual  steps  in  the  Miazza  design,  a  set 
of  target  scenes  and  templates  were  generated.  The  scenes  and  templates  ranged 
in  complexity  from  simple  geometric  figures  to  actual  photographs  of  tanks  and 
trucks.  In  template  preparation,  these  images  were  transferred  to  high-resolution, 
glass  plates  for  optical  implementation.  They  were  also  stored  in  digitized  files  on 
the  VAX  11/780  computer  in  the  Air  Force  Institute  of  Technology's  engineering 
building.  The  software  package  “Executive”  developed  by  Kobel  and  Martin  [9:Vol 
II]  to  digitally  implement  the  Kobel-Martin-Horev  algorithm  verified  the  accuracy 
of  the  optical  data  obtained  during  this  research. 

Correlation  of  the  feature  space  is  demonstrated  in  three  phases.  First,  the 
target  scenes  and  templates  are  Fourier  transformed  and  then  In  r  —  0  coordinate 
transformed.  Next,  a  Vander  Lugt  filter  of  the  templates  is  made  with  a  thermo¬ 
plastic  holographic  camera.  Finally,  the  Vander  Lugt  filter  is  used  to  produce  corre¬ 
lations  of  the  transformed  target  scenes  with  the  transformed  target  templates.  The 
correlation  peaks  are  analyzed  for  accuracy  of  indicating  target  scale  and  rotation 
angle. 

1.5  Overview  of  Thesis 

This  thesis  is  structured  to  present  the  material  in  the  natural  order  that  the 
investigation  progressed.  The  present  chapter  introduced  the  basic  problem  to  be 
addressed  and  then  quickly  summarized  some  of  the  background  work  which  supports 
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this  thesis  effort.  In  addition,  this  chapter  explained  the  scope  of  the  thesis,  as  well 
as,  the  approach  taken  to  solve  the  problem. 


Chapter  II  discusses  the  theoretical  background  of  many  of  the  basic  opti¬ 
cal  concepts  investigated  during  this  thesis  effort  and  previously  mentioned  in  the 
Background  section  of  this  chapter.  These  concepts  include:  correlations,  Fourier 
transforms,  coordinate  transforms,  CGHs,  and  matched-filtering.  Additionally,  the 
Horev  algorithm  and  the  Miazza  architecture  are  detailed  and  explained. 

Chapter  III  details  how  the  optical  concepts  introduced  in  Chapter  II  are  used 
during  this  thesis  effort.  Experimental  setups  for  implementing  these  optical  con¬ 
cepts  are  shown  and  explained.  Additionally,  the  digital  simulations  performed  (o 
verify  the  optical  results  are  discussed. 


Chapter  IV  explains  and  analyzes  the  results  of  both  the  optical  experiments 
and  the  digital  simulations.  Photographs  of  both  />n  r  -  fg  coordinate  transforma¬ 
tions  and  correlation  peaks  are  presented. 


Finally,  Chapter  V  presents  pertinent  conclusions  based  on  the  available  data. 
In  addition,  recommendations  for  furthering  this  research,  including  suggested  ex¬ 
periments,  are  made. 
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II.  Basic  Concepts  and  Architectures 


2.1  Correlation 

One  concept  that  has  been  given  much  attention  in  developing  a  pattern  recog¬ 
nition  algorithm  is  correlation  [8].  Correlation  is  a  mathematical  comparison  of  two 
signals  (or  in  this  case  images)  that  measures  the  degree  of  similarity  between  the 
shape  of  one  image  to  another.  The  advantage  of  correlating  an  image  f(x,y)  with 
a  target  template  t(x,y)  is  that  the  result  will  be  a  distinguishable  autocorrelation 
peak  when  the  image  and  the  target  are  the  same,  and  an  equally  distinguishable 
crosscorrelation  peak  when  the  image  and  the  template  are  dissimilar.  Mathemati¬ 
cally,  correlation  can  be  described  by  [6:78] 

c{x,y)  =  /(a, /*)★<(<*,/?) 

-  f  I  f(a.fl)tm(a-x,j3-y)dadj  (1) 

J  —  oo  J  ~oo 

where  a  and  are  dummy  variables  of  integration,  the  asterick  (*)  means  complex 
conjugate,  the  star  (★)  is  the  mathematical  symbol  for  correlation,  and  c(.r,;y)  is  the 
correlation  result.  Alternatively,  correlation  can  be  expressed  in  terms  of  the  Fourier 
transform  as  follows[6:10] 

c(x,y)  =  T~'{FT'}  (2) 

where  Jr~l  is  the  symbol  for  an  inverse  Fourier  transform  operation  and  /•’  and  T 
are  the  Fourier  transforms  of  /(o,/3)  and  respectively.  Anot  her  advantage  of 

correlating  signals  is  that  an  autocorrelation  results  in  a  peak  positioned  relative  to 
the  location  of  the  target.  Furthermore,  correlators  will  detect  more  t  han  one  target 
in  a  scene  [8]. 

The  major  disadvantage  of  correlation  is  that  the  performance  is  severely  de¬ 
graded  by  both  scale  and  rotation  differences  between  the  template  and  the  target. 
Casasent  and  Psaltis  reported  that  the  signal-to-noise  ratio  (SNR)  of  a  correlation 
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Figure  1.  Casasent  and  Psaltis’  PSRI  Algorithm 

peak  would  drop  27  decibels  (dB)  when  the  scale  of  the  target  and  template  differed 
by  2  percent  or  when  the  target  was  rotated  3.5  degrees  with  respect  to  the  template 
[3:77]. 

2.2  PSRI  Feature  Space 

In  1978,  Casasent  and  Psaltis,  suggested  that  the  scene  space,  which  is  the 
domain  of  the  original  image,  be  transformed  into  a  feature  space  that  is  invariant 
to  changes  in  a  target’s  scale  and  rotation  [2:339-340].  In  such  a  feature  space,  a 
correlation  could  be  performed  without  loss  of  SNR  of  the  autocorrelation  peak.  The 
key  questions  here  are  whether  this  feature  space  exists,  and  how  much  information 
about  the  target  will  be  lost  when  a  target  is  transformed  into  this  feature  space? 

Casasent  and  Psaltis  proposed  a  feature  space  that  was  position,  scale,  and  ro¬ 
tation  (in-plane)  invariant  (PSRI).  To  derive  a  PSRI  feature  space,  their  algorithm 
uses  four  separate  transformations.  The  transformation  sequence  is  outlined  in  Fig¬ 
ure  I  [2:339-345],  The  first  step,  taking  the  Fourier  transform  (FT)  of  the  scene 
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Figure  2.  (a)  Input  Square,  (b)  167%  Scale  Change  and  45°  Rotation. 

space,  utilizes  the  shifting  property  of  the  Fourier  transform  [6:9]: 

f(x  ±  x0)  F(fx){exp(±j2irx0fx))  (3) 

This  property  states  that  the  Fourier  transform  of  a  shifted  signal  f{x)  is  equal  to 
the  Fourier  transform  of  f(x)  multiplied  by  a  linear  phase  term  [6:277],  Therefore, 
taking  the  magnitude  of  the  Fourier  transform  (  |  F(fx,  fy)  | )  eliminates  the  phase 
term  and  the  positional  dependence.  Optically,  this  means  that  the  Fourier  transform 
of  an  image  will  always  be  centered  on  the  optic  axis  of  the  Fourier  plane.  Then,  if 
only  intensity  (which  is  the  magnitude  squared)  is  considered,  the  Fourier  transform 
of  a  shifted  image  will  look  exactly  like  that  of  the  unshifted  image. 

The  second  step  in  the  algorithm  is  to  perform  a  polar  coordinate  transforma¬ 
tion  on  |  F(fxJy)  |  yielding  ]  F{fT,fg )  |  where  /r  represents  the  distance  in  the 
radial  direction  and  fg  represents  the  angle  in  the  rectangular  coordinate  system. 

This  transformation  converts  rotational  shifts  about  the  origin  of  the  rectangular 
frequency  domain  into  linear  shifts  along  the  fg  axis  of  the  polar  frequency  domain 
(see  Figures  2  and  3). 
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(a) 


Figure  3.  Theoretical  In  r  —  0  Coordinate  Transform  of  Squares  in  Figure  2  (a) 
and  (b).  Note:  The  transform  in  (b)  is  shifted  up  and  to  the  right. 

The  third  step  is  to  logarithmically  (In)  scale  the  radial  axis  of  the  polar 
transformation.  This  transformation  converts  scale  changes  in  the  rectangular  FT 
space  into  linear  shifts  of  |  F{fr,  fg)  |  along  the  radial  axis.  Consider  the  FT  scaling 
property  [6:9] 

f{x/a,y/a)  a2  F{afx,afy)  ^  a2F{afT,f8)  (1) 

where  a  is  a  scaling  constant.  Taking  the  natural  logarithm  of  the  radial  component 
yields  [9:9] 

ln(a/r)  =  ln(/r)  +  ln(a)  (5) 
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As  demonstrated  in  Figures  2  and  3,  the  logarithm  converts  the  scale  difference  in 
the  rectangular  domain  into  a  linear  shift  on  the  /in,.  axis. 

In  the  final  step  of  the  Casasent  algorithm,  the  FT  of  |  F(fin  r,  fg)  \  converts 
the  linear  shifts  along  the  r  and  fg  axes  into  phase  terms  that  can  be  eliminated 
with  an  intensity  detector  [2:342].  Performing  template  correlation  in  Casasent  and 
Psaltis’  PSRI  feature  space  will  identify  the  presence  of  a  target.  However,  it  will 
not  be  able  to  provide  the  parameters  necessary  for  target  acquisition:  position  in 
the  field  of  view,  scale,  and  rotation. 

2.3  Horev  Algorithm 

In  1980,  Horev  developed  a  pattern  recognition  algorithm  [7]  that  provided  the 
foundation  for  the  future  development  of  a  real-time  pattern  recognition  machine  at 
the  Air  Force  Institute  of  Technology  (AFIT).  A  flow  chart  of  the  algorithm  is  shown 
in  Figure  4.  The  algorithm  utilizes  the  previous  architecture  explored  by  Casasent 
and  Psaltis.  The  beauty  oe  his  design  is  two-fold.  First,  he  performs  his  correlation 
at  step  3  in  the  Casasent  and  Psaltis  algorithm  (see  Figure  1).  Performing  the  cor¬ 
relation  in  the  |  F(f\nT,fg)  |  feature  space  preserves  the  target’s  scale  and  rotation 
information.  Now  the  target’s  scale  and  rotation  can  be  specified  by  determining  the 
position  of  the  target-template  autocorrelation  peak  relative  to  a  predetermined  ref¬ 
erence  point.  The  scale  and  rotation  information  is  then  used  to  generate  a  rectified 
template  that  has  the  same  scale  and  rotation  as  the  target.  Horev  accomplished 
this  by  shifting  the  template  feature  space  by  an  appropriate  amount  along  the  r 
and  fg  axes. 

Second,  Horev  introduces  a  second  stage  that  extracts  the  phase  information 
from  the  FT  of  the  target.  The  phase  is  added  to  the  rectified  template  and  in¬ 
verse  Fourier  transformed  to  yield  a  rectified  target  image.  Since  the  phase  contains 
the  target’s  positional  information,  and  the  rectified  template  contains  only  the  fre¬ 
quency  energy  of  the  target,  the  target’s  scene  clutter  (noise)  will  be  reduced  and  the 
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Figure  4.  Horev  Algorithm.  Note:  The  addition  of  the  phase  extraction  and  rein¬ 
sertion  stages  allows  for  target  segmentation. 
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target  enhanced  [7,  9j.  Horev  uses  the  rectified  target  image  and  a  rectified  template 
to  perform  a  correlation  in  the  scene  space  to  determine  the  location  of  his  target. 
Since  the  template  has  been  adjusted  to  the  target’s  scale  and  rotation  (in-plane), 
the  algorithm  has  sought  to  maximize  the  SNR  of  the  autocorrelation  peak  [7:87-88]. 

Horev  digitally  implemented  his  algorithm  and  found  that  it  had  definite 
promise  in  high  clutter  rejection  [7:87],  However,  the  computer  implementation  took 
over  four  hours  to  process  one  scene  [7:89].  Kobel  and  Martin  conducted  a  follow-on 
study  with  Horev’s  algorithm  [9:9].  They  developed  another  digital  implementation 
that  reduced  the  processing  time  down  to  about  half  an  hour  per  scene  [8].  Then- 
found  that  the  algorithm  recognized  multiple  targets  in  a  broad  range  of  non-random, 
noise  corrupted  scenes  [9:78-79].  However,  the  problem  with  the  intense  computa¬ 
tions  associated  with  the  Fourier-transformed,  coordinate-transformed  correlations 
remained.  To  overcome  this,  the  feasibility  of  optical  implementation  needed  to  be 
investigated. 

2-4  Optical  Image  Processing 

Fourier  transforms,  coordinate  transformations,  and  correlation  schemes  have 
been  heavily  scrutinized  as  potential  pattern  recognition  tools  since  they  can  be 
implemented  optically  and  significantly  reduce  the  processing  time  [9],  Using  these 
tools,  the  goal  is  to  design  an  optical  pattern  recognition  system  that  can  provide 
parallel  processing  at  the  speed  of  light. 

The  Fourier  transformation  is  a  fundamental  building  block  in  optical  image 
processing.  Placing  an  image  in  the  input  focal  plane  of  a  lens  will  yield  the  Fourier 
transform  of  the  image  in  the  output  focal  plane  [6:83-90].  Thus,  the  first  step 
in  the  Kobel-Martin-llorev  algorithm  is  easily  accomplished  optically  by  sensing 
the  intensity  of  the  Fourier  transform  of  the  input  data.  The  Fourier  transform 
property  is  also  used  to  design  computer  generated  holograms  (CGII)  to  perform 
coordinate  transformations.  The  coordinate  transformation  is  developed  from  the 
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following  [1,  16] 

F{u,v)  =  f  f  f{x,y)exp(j<j>(x,y))exp[-j(2ir/XfL)(xu  +  yv)]dxdy  (6) 

J  —  oo  J  —  oo 

where 

F(u,v)  =  coordinate  transform  of  /(x,y) 

X  =  wavelength  of  light 

fL  =  focal  length  of  the  Fourier  transforming  lens 


This  equation  states  there  is  a  phase  function  exp[;0(x,y)]  that  can  be  multiplied 
by  an  image  /(x,y)  and  Fourier  transformed  to  yield  a  specified  coordinate  trans¬ 
formation. 

For  the  /|nr  —  fg  coordinate  transformation,  Eq  6  can  be  solved  for  o(.v.y) 
using  [1:939] 


u(x,y)  =  ln(x2  +  i/2)'/2 

(") 

v(x,y)  =  —  arctan(y/x) 

(8) 

and  the  approximate  saddle  point  integration  method.  This  method  yields  [16:9] 

<t>{x,y)  =  -  [x  ln(x2  +  r/2),/2  -  y  arctan(t//x)  -  x]  (9) 

A  CGH  is  a  binary  interferogram  of  the  interference  pattern  of  <j>{x.y)  and  a  plane 
wave  at  an  angle  0.  The  lines  of  the  CGH  must  be  plotted  to  satisfy  [1:939] 

27tqx  -  d>(x,  y)  =  27rn  (10) 

where  n  is  the  integer  that  indicates  different  lines,  and  a  =  sin  0/X.  Once  the 
PSRI  feature  space  has  been  created,  the  optical  correlation  must  be  accomplished. 

Optical  correlation  is  accomplished  using  a  Vander  Lugt  filter  [6:171-183],  A 
Vander  Lugt  filter  has  a  transmittance  proportional  to  the  Fourier  transform  of 
the  template  that  is  amplitude  and  phase  modulated  with  a  high  frequency  carrier. 
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The  filter  is  produced  by  exposing  high  resolution  film  directly  with  the  Fourier 
transform  of  a  template,  while  simultaneously  exposing  the  film  at  an  angle  with 
a  coherent,  plane  wave.  When  the  developed  filter  is  illuminated  directly  with  the 
Fourier  transform  of  a  target  image,  the  output,  when  inverse  Fourier  transformed, 
will  be  the  correlation  of  the  target  with  the  template  traveling  away  from  the  filter 
plane  at  the  same  angle  the  filter  was  originally  exposed  with  the  plane  wave. 

2.5  Miazza  Architecture 

In  1988,  Miazza  introduced  a  design  for  a  real-time  coherent  optical  pat¬ 
tern  recognition  system  that  implements  the  Horev-Kobel- Martin  algorithm  [18], 
A  schematic  of  his  design  is  shown  in  Figure  5.  The  key  to  his  design  is  the  use  of 
spatial  light  modulators  (SLM),  optical  devices  that  control  the  output  intensity  of 
an  incident  light  pattern,  to  perform  computationally  intensive  image  processing  at 
real-time  rates. 

Miazza  uses  SLMs  to  perform  four  different  processes  in  his  system:  incoherent- 
to-coherent  light  (ITC)  conversion,  phase  extraction,  and  phase  insertion.  In  Figure 
5,  SLM  1  functions  as  an  ITC  converter  for  the  input  image.  This  allows  the  system 
flexibility  in  the  type  of  input  images  it  can  process.  SLM  2  and  SLM  4  both 
remove  the  phase  of  the  image  and  allow  the  magnitude  to  pass  through  t  he  system 
for  processing.  Phase  removal  is  very  important  since  the  In  r  —  0CGII  and  the 
Vander-Lugt  filter  F3  are  both  designed  to  process  only  the  magnitude  of  an  image. 
SLM  3  and  SLM  5  are  the  highlight  of  the  system  because  they  perform  a  very 
difficult  task  in  optics— the  phase  extraction  and  reinsertion  step  of  the  IIorcv-Kobel- 
Martin  algorithm  (see  Figure  4).  Miazza  proposes  that  a  SLM  be  built  (SLM  3)  with 
either  a  transmittance  or  a  reflectance  inversely  proportional  to  the  intensity  of  the 
input  image.  Therefore,  the  magnitude  of  the  output  image  will  be  unity,  and  the 
output  will  retain  the  phase  information  of  the  input.  A  separate  solution,  using 
nonlinear  photorefractive  crystals,  has  also  been  proposed  [19].  SLM  5  takes  the 
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phase  information  from  SLM  3  and  places  the  magnitude  information  of  the  rectified 
template  on  the  same  image. 

Miazza’s  design  is  dependent  on  spatial  light  modulators  that  are  still  being 
designed  and  tested.  Consequently,  only  a  portion  of  his  system  has  been  validated 
in  a  laboratory.  Casasent,  Saito,  and  Mayo  have  fabricated  and  tested  a  In  r  —  0 
CGH  interferogram  [1,  10,  12,  16].  However,  in  each  case  only  the  scene  space  was 
transformed  instead  of  testing  coordinate  transformation  of  the  position  invariant 
magnitude-squared  Fourier  transform  space. 

2.6  Summary 

Optical  techniques  may  be  the  key  in  the  development  of  a  real-time  pattern 
recognition  machine.  Pattern  recognition  algorithms  that  utilize  Fourier  Transforms 
are  ideal  for  optical  processing  since  they  can  be  implemented  with  one  lens.  Miazza 
has  designed  a  real-time  optical  pattern  recognizer  based  on  an  algorithm  developed 
by  Horev.  The  design  uses  spatial  light  modulators,  lenses,  and  a  computer  generated 
hologram.  These  techniques  need  to  be  tested  in  a  laboratory  environment. 
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III.  Extended  Analysis  of  Optical  Implementation  and  Digital 

Simulation 


3. 1  Introduction 

This  chapter  details  the  experiments  performed  and  analyses  made  while  im¬ 
plementing  portions  of  the  Kobel-Martin-Horev  (KMH)  algorithm  optically.  Addi¬ 
tionally,  a  section  of  the  chapter  is  dedicated  to  the  digital  simulations  that  wen' 
performed  in  parallel  with  the  optical  experiments  to  verify  results  obtained  optically. 

The  next  section  of  this  chapter  discusses  the  initial  imaging  experiments  re¬ 
quired  to  prepare  a  set  of  binary  templates  and  reproduceable  stored  images.  This 
is  followed  by  a  section  on  Fourier  transforming,  including  setups  used  and  prob¬ 
lems  encountered.  Next  is  a  section  on  the  computer  generated  hologram  (CGI1). 
detailing  how  it  was  used  to  produce  the  f\nr  —  f$  coordinate  transformation  and 
problems  associated  with  linearity  and  space-bandwidth  product  (SBVVP).  A  sec¬ 
tion  on  optical  correlations  follows.  In  this  section,  details  of  the  Vander  Lugt  filter 
and  thermoplastic  holography  are  given.  The  next  to  last  section  details  the  digital 
simulation,  and  the  last  section  summarizes  the  important  highlights  of  this  chapter. 

3.2  Defining  Images 

After  becoming  more  familiar  with  the  equipment  available  for  this  research, 
the  researchers’  first  order  of  business  was  to  decide  upon  a  set  of  images  to  use 
as  binary  templates  throughout  the  research.  The  set  agreed  upon  consisted  of  the 
following: 

1.  Simple  geometric  images  to  include  a  square,  an  edged  square,  and  a  triangle. 

2.  Geometric  combinations  to  include  a  square,  triangle,  and  circle;  and  an  edged 

square,  triangle,  and  circle. 
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3.  Letters  to  include  an  “A”  and  an  “ASL.” 

4.  Cutouts  of  tanks,  trucks,  and  F-15s. 

It  should  be  noted  that  each  of  the  images  listed  above  was  produced  in  at 
least  two  different  scales  and  could  be  rotated  and  translated  at  will  within  the 
optical  systems.  This  scale,  rotation,  and  translation  variation  allowed  this  research 
to  address  three  of  the  six  problem  areas  identified  by  Kobel  and  Martin  [9:8] . 

Simple  geometric  images  were  chosen  as  research  images  because  the  symmetry 
of  these  objects  allowed  us  to  predict,  ahead  of  time,  the  shape  of  their  Fourier 
transforms  and  coordinate  transforms.  Using  them  would  then  allow  us  to  have 
confidence  in  the  optical  implementation  if  observed  results  were  commensurate  with 
predicted  results.  Letters  were  chosen  because  much  pattern  recognition  work  has 
centered  around  recognizing  the  written  word.  Finally,  tanks,  trucks,  and  F  los  were 
used  because  segmented  images  were  readily  available,  and  the  military  has  a  need 
to  be  able  to  identify  them  as  potential  targets. 

Once  the  desired  images  had  been  identified,  the  templates  had  to  be  pro¬ 
duced.  An  initial  attempt  was  made  at  photographing  (using  a  35mm  camera)  a 
white  cutout  on  black  background  using  instant  slide  film.  However,  results  were 
less  than  desirable  (see  Appendix  A).  To  get  high  quality,  binary  templates,  high 
resolution  glass  photographic  plates  were  used.  Details  of  how  the  binary  templates 
were  produced  on  the  glass  plates  can  be  found  in  Appendix  A. 

With  a  set  of  high  quality,  binary  templates  now  in  hand,  research  could  pre¬ 
cede.  Knowing  that  digital  manipulations  would  be  performed  on  the  template's,  a 
means  of  transferring  them  to  the  VAX  I780A  had  to  be  devised.  The  first  step 
was  to  get  the  images  stored  on  floppy  diskettes.  The  optical  setup  shown  in  f  ig¬ 
ure  6  is  the  setup  used  to  store  the  images.  The  two  lens  imaging  system  was  set 
up  for  an  image  reduction  of  2.5  times.  This  allowed  the  entire  image  of  even  the 
largest  template  to  be  stored  successfully.  Filter  Fl  was  actually  as  many  neutral 
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Figure  6.  Setup  for  imaging  binary  templates  onto  CCD  camera.  Note:  Used  to 
store  images  on  disk  at  2.5x  reduction. 

density  filters  as  necessary  to  reduce  background  illumination  to  provide  as  binary 
an  image  as  possible.  Templates  were  placed  in  a  filter  holder  and  illuminated  with 
the  coherent  laser  light  source.  The  CCD  camera,  driven  by  the  AT&T  TARGA 
framegrabber,  then  grabbed  the  images.  Once  grabbed  into  the  TARGA’s  memory, 
the  images  could  be  transferred  to  floppy  disk.  The  images  were  then  transferred  to 
the  VAX  using  the  file  transfer  protocol  (ftp)  routine.  This  procedure  is  explained  in 
detail  in  Appendix  D.  Figure  7  shows  a  photograph  of  the  same  small  square  shown 
above  after  it  was  saved  on  disk  and  transferred  to  the  VAX. 

The  next  step  of  the  KMH  algorithm  called  for  taking  the  magnitude  of  tin' 
Fourier  transform  (]FT|)  of  the  images.  Optical  implementation  of  this  step  is 


Figure  7.  Photograph  of  small  square  stored  on  diskette.  Note:  Taken  from  display 
of  a  microVAX  workstation. 

detailed  in  the  following  section. 

3.3  Fourier  Transforms 

3.3.1  Background  The  very  essense  of  any  optical  lens  system  is  the  inherent 
ability  of  a  positive  (converging)  lens  to  produce  the  Fourier  transform  of  a  coherent 
input.  Therefore,  it  is  no  surprise  that  the  initial  image  transformation  in  this 
thesis  work,  as  it  was  in  the  KMH  algorithm,  is  the  Fourier  transform  (  FI'  ) 
of  the  input  image  scene.  While  this  step  alone  is  the  limiting  time  factor  that 
hinders  the  KMH  algorithm  from  being  implemented  in  near  real-time,  it  can  be 
realized  instantaneously  in  an  optical  system.  This  is  a  very  distinct  advantage  ol 
implementing  the  KMH  algorithm  optically. 


Figure  8.  Schematic  of  Optical  Fourier  Transform  Setup  [6:87]  . 

Kobel  and  Martin  used  a  two-dimensional,  fast  Fourier  transform  algorithm 
(  2DFFT  )  for  computing  their  initial  FT  .  This  method  took  approximately  a 
minute  to  calculate  the  FT  of  a  256  by  256  pixel  image  array  using  AFIT's  1780a 
VAX.  With  the  typical  optical  setup  shown  in  Figure  8,  FT s  are  generated  at 
the  speed  of  light.  When  image  t(x,,t/,)  is  placed  at  a  distance  d0  in  front  of  a 
converging  lens,  the  output  image  Ff(xf,yf)  located  at  the  lens’  focal  plane  (one 
focal  length  behind  the  lens)  has  a  direct  relationship  to  the  FT  of  the  input  image. 
This  relationship  is  described  by  the  following  equation  [6:86] : 

Aexp[;jj(i  -  if){ x )  +  t /})] 

j2\f 

no  roo  2  7T 

/  t(x„  y,)  exp[-;  —  {xiXf  +  y,yf  )]dx0dij0  ( 1  !  ) 
-oo  J  —  oo  A  J 

where 

/  =  focal  length  of  the  converging  lens 
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A  =  wavelength  of  coherent  light 

d0=  distance  of  input  in  front  of  the  lens 

k  =  wave  number  (^) 

A  =  amplitude  constant  for  light  source 

This  equation  states  that  the  amplitude  and  phase  at  the  focal  plane  (x/,j//)  are 
related  to  the  amplitude  and  phase  of  the  frequency  spectrum  of  t(x,,yt)  at  the 
frequency  (.t//A/, yj/^f)  [6:86].  The  output  Ufix/,yf)  is  actually  the  amplitude- 
scaled  FT  of  the  image  multiplied  by  a  phase  function  4>  .  Therefore, 

(//<*/.»/)  =  nfi.fi)*  (i2) 


where 


T(^,  rfj)  =  FT  of  the  input  image 


$  = 


J2A  / 


Notice  that  when  da  =  /,  $  is  equal  to  a  constant  A/ j2\f  and  (  j(xj,yj)  is  an 
exact  FT  of  t(x0,y0 )  scaled  by  the  constant  A/j2\f  . 

3.3.2  Analysis  In  this  research,  a  helium  neon  (HeNe)  laser  was  used  as  the 
coherent  light  source,  and  a  300  mm  focal  length,  /  ,  converging  lens  was  used  as 
the  Fourier  transforming  lens.  Since  the  computer  generated  hologram  (CGH)  used 
in  the  second  transformation  of  the  AFIT  algorithm  can  transform  only  real  inputs, 
the  phase  information  must  be  removed  from  the  Fourier  transformed  image.  1  his 
can  be  accomplished  by  one  of  two  methods:  first,  using  a  charged  coupled  device 
(CCD)  camera  with  a  frainegrabber  board;  and  second,  recording  the  image  directly 
on  high  resolution,  black  and  white  35  mm  film. 

Of  the  two  techniques,  using  the  CCD  camera  with  a  framegrabber  is  the  only 
technique  usable  in  a  real  time  optical  system.  Actually,  to  build  a  continuous  system 
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that  detects  the  input  image’s  intensity  and  retransmits  the  intensity  distribution 
without  its  phase  would  also  require  a  spatial  light  modulator  (SLM).  Miazza  pro¬ 
posed  that  a  grey  level  sensitive  optically  addressed  SLM  be  used  for  real  time  phase 
removal  [13].  Since  this  type  of  SLM  was  not  available  for  study,  we  used  a  512  by 
512  element  Sony  model  XC-38  CCD  camera  and  a  AT&T  TARGA  framegrabber 
board  installed  in  a  Zenith  248  (IBM  AT  compatable)  personal  computer.  The  CCD 
camera  detected  intensity  of  the  image  signal.  The  detected  image  intensity  was  then 
digitized  and  stored  in  memory  by  the  TARGA  board.  To  retransmit  the  image  in¬ 
tensity,  we  used  a  Semetex  128  by  128  pixel  Sight  Mod.  This  is  a  magneto-optical 
bin  ry  SLM.  Since  this  device  was  binary,  that  is,  no  grey  scale,  it  limited  the  use  of 
this  technique  to  showing  that  image  intensities  can  be  detected  and  retransmitted 
at  near  real  time.  For  further  discussion  of  the  Semetex  Sight  Mod  sex?  Appendix  C. 

Mayo  discovered  that  the  CCD  camera  was  very  sensitive  to  high  intensity 
peaks  like  the  dc  (or  center)  of  a  FT  [12].  When  the  CCD  array  was  exposed  to 
a  high  intensity  peak,  the  camera  exhibited  a  “blooming”  phenomena  where  a  peak 
that  is  actually  only  a  few  pixels  wide  appears  as  a  bright  cross  pattern  centered  at  the 
peak  with  its  legs  running  parallel  to  the  rows  and  columns  of  the  pixel  array.  Since 
the  dc  represents  the  average  intensity  of  the  original  image,  its  information  content 
is  considered  insignificant  to  pattern  recognition.  Therefore,  this  dc  portion  could 
be  eliminated  without  detracting  from  the  research.  To  do  this,  a  high-pass  filter 
(IIPF)  was  generated  using  high  resolution  reduction  photography  (see  Appendix 
A).  The  filter  consisted  of  a  circle  200  nm  in  diameter  with  a  uniform  attentuation 
of  approximately  2000  times.  A  comparison  of  a  FT  of  a  square  saved  with  and 
without  the  IIPF  is  shown  in  Figure  9.  It  can  be  seen  in  Figure  9(b)  that  using  the 
HPF  enabled  the  CCD  camera  to  resolve  more  diffracted  orders  of  the  FT  image 
and  eliminated  the  “blooming”  that  is  evident  near  the  center  of  Figure  9(a). 

The  schematic  for  taking  the  high  pass  filtered  FT  is  shown  in  Figure  10. 
Mayo  filtered  his  FT  image  by  placing  the  HPF  against  the  camera  opening.  Fig- 
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ure  10(a)  shows  that  the  CCD  array  was  located  in  the  focal  plane  of  the  Fourier 
transforming  lens,  and  due  to  the  camera  body,  the  HPF  was  actually  placed  ap¬ 
proximately  7-10  mm  in  front  of  the  focal  plane.  This  configuration  made  the  filter 
difficult  to  center,  and  since  the  HPF  was  not  in  the  focal  plane  of  the  FT  lens, 
the  desired  blocking  affect  was  not  attained.  A  more  accurate  method  of  filtering  is 
shown  in  Figure  10(b).  The  HPF  was  placed  in  the  focal  plane  of  the  FT  lens  (L  \ ) 
and  the  focal  plane  of  L\  was  imaged  onto  the  CCD  camera  using  a  second  lens  L 2. 

Two  techniques  were  discovered  that  were  very  helpful  in  ensuring  that  the 
FT  was  properly  setup.  First,  to  ensure  that  the  CCD  camera  was  in  the  imaging 
plane  of  L2  ,  a  multiple  geometric  object  template  was  used  as  an  input  image.  Due 
to  the  FT  shifting  property  (see  Chapter  2),  objects  located  at  different  positions 
will  still  have  dc  terms  located  at  the  exact  same  position  in  the  Fourier  plane. 
Therefore,  the  distance  between  the  imaging  lens  L2  and  the  CCD  camera  can  be 
adjusted  until  the  dc  terms  overlap.  Second,  to  ensure  that  the  HPF  was  located 
precisely  in  the  focal  plane  of  L\  ,  the  FT  shift  theorem  was  again  exploited. 
Movement  of  a  single  object  in  the  input  plane  will  not  change  the  location  of  the 
FT  with  respect  to  the  HPF.  Therefore,  the  distance  of  the  HPF  with  respect  to 
L i  was  adjusted  until  the  dc  and  the  HPF  remained  in  the  same  location  when  the 
input  object  is  moved.  With  these  two  techniques  properly  executed,  the  system  was 
aligned  and  ready  to  record  Fourier  transforms. 

Since  a  grey  scale  capable  SLM  was  not  available,  we  recorded  FT  image's 
directly  on  35  mm  high  contrast  slide  film.  Polaroid’s  HC-135  instant  slide  film  w as- 
found  to  give  the  best  results.  To  make  the  35  mm  slides,  a  moveable  mirror  was 
placed  between  L2  and  the  CCD  camera  (see  Figure  10(b)).  This  mirror  allowed 
switching  between  the  CCD  and  the  35  mm  camera.  The  mirror  diverts  the  FT 
image  to  a  single  lens  reflex  (SLR)  camera  body  (with  no  camera  lens).  1'he  average 
power  of  the  FT  image  at  the  SLR  camera  was  measured  to  ensure  that  it  was  at 
eye  safe  levels.  A  half  a  milliwatt  or  less  is  considered  eye  safe.  The  level  used  in  this 
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Figure  10.  Optical  Fourier  transforming  setup  where  (a)  was  Mayo’s  method  and 
(b)  was  used  in  this  research.  Note:  The  HPF  (dc-block)  is  in  the  back 
focal  plane  of  Ly  to  remove  low  frequency  components. 
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setup  was  in  the  tens  of  microwatts.  While  sighting  through  the  camera’s  viewfinder, 
the  multiple  object  technique  mentioned  above  was  used  to  focus  the  |FT|2  onto  the 
film  plane  of  the  camera.  The  SLR  camera’s  automatic  mode  was  used  only  to 
estimate  the  shutter  speed  needed  for  a  correct  exposure  since  the  automatic  shutter 
tended  to  produce  overexposed  slides.  Therefore,  the  camera’s  manual  mode  was 
used,  bracketing  two  or  three  shutter  speeds  above  the  automatic  shutter  setting. 

As  with  the  binary  templates,  the  Fourier  transforms  had  to  be  digitally  saved 
for  later  processing  on  the  VAX.  This  was  accomplished  by  turning  the  mirror  shown 
in  Figure  10(b)  so  the  FT  was  incident  on  the  CCD  camera  array  and  then  “grab¬ 
bing”  the  image  in  an  identical  manner  as  explained  earlier. 

The  next  step  in  the  optical  implementation  is  transforming  the  recorded 
Fourier  transforms  to  the  f\nT  —  fg  coordinate  transform  space.  This  transfor¬ 
mation  is  the  subject  of  the  following  section. 

3.4  Polar- Logarithmic  Coordinate  Transformation  Using  a  CGH 

3. 4-1  Introduction  This  section  discusses  the  optical  implementation  of  a 
computer  generated  interferogram  as  performed  in  this  thesis  research.  The*  ('(III 
was  used  to  coordinate  transform  the  Fourier  transform  intensity  patterns  discussed 
in  the  previous  section  into  f\n  r  —  fg  feature  spaces.  Detailed  procedures  and  mathe¬ 
matical  formulations  used  to  solve  for  the  interferogram  and  to  produce  the  optically 
implemented  version  of  the  CGH  can  be  found  in  Appendices  A  and  B. 

3.4.2  Optical  Implementation  The  researchers’  first  attempt  at  optically  im¬ 
plementing  the  /|n  r  —  fg  feature  space  was  modeled  after  the  setup  used  by  Mayo 
as  shown  in  Figure  11  [12:22].  The  input  image  (|FT[2)  was  placed  directly  against 
the  CGH  “in”  plane  Pt.  Obviously,  this  is  impossible.  Since  both  the  CGH  and  the 
input  image  (  a  Polaroid  slide  with  holder)  have  physical  dimensions,  one  cannot 
overlap  them  and  still  have  both  in  the  exact  same  plane.  Additionally,  since  the 
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center  of  the  |FTj2  must  be  aligned  precisely  with  the  center  of  the  CGH  for  a  sym¬ 
metrical  transformation,  butting  the  CGH  and  input  up  against  one  another  failed 
due  to  the  lack  of  an  accurate  alignment  procedure. 

To  overcome  the  in-the-same- plane  problem,  the  |FTj2  needed  to  be  imaged 
onto  the  CGH.  To  overcome  the  alignment  problem,  the  |FT|2  slide  needed  to  be 
mounted  in  an  xyz- adjustable  holder.  The  schematic  shown  in  Figure  12  details  the 
optical  setup  used  to  solve  the  problems  outlined  above. 

The  two-lens  imaging  system  comprised  of  L\  and  L2  accomplished  three  es¬ 
sential  functions  in  this  setup.  First,  L\  and  L2  were  separated  by  a  distance  equal  to 
f\  +  f2-  This  produced  the  plane  wave  coming  out  of  L2  that  was  required  to  illumi¬ 
nate  the  CGH,  since  only  a  real  input  could  be  coordinate  transformed.  Second,  the 
ratio  of  f2  to  fi  (750mm  to  250mm)  gave  an  effective  magnification  of  three  times. 
This  magnification  was  required  to  overcome  a  space-bandwidth  product  (resolution) 
problem  which  will  be  discussed  in  more  detail  later.  Finally,  L\  and  L2  provided 
the  means  of  imaging  the  input  directly  onto  the  plane  of  the  CGII. 

With  the  |FT|2  imaged  onto  the  CGH,  the  combination  of  the  binary  interfer¬ 
ence  pattern  of  the  CGH  and  the  Fourier  transforming  lens  Z.3  performs  the  coordi¬ 
nate  transform  function.  The  f\nT  —  fe  CT  can  then  be  viewed  at  the  back  focal 
plane  (/3  =  300mm)  of  L3.  To  view  the  CTs,  the  CCD  camera  was  again  used. 
Figure  13  shows  a  photograph  of  an  optically  produced  CT  of  a  small  square. 

S.j.S  Alignment  and  Focusing  As  mentioned  in  the  previous  section,  aligning 
the  input  image's  center  with  the  center  of  the  CGH  was  a  problem.  The  following 
procedure  was  used  to  solve  this  problem.  With  the  input  (35mm  slide  mounted  in 
slide  holder)  inserted  in  a  Newport  Corporation  LP-2  lens  holder,  which  gives  .n/r 
translation  adjustment,  the  CT  was  viewed  in  real-time  on  a  monochrome  computer 
monitor.  The  AT&T  TARGA  framegrabber  was  again  being  used  to  grab  and  display 
the  image  incident  on  the  CCD  pixel  array.  As  the  xy  adjustments  of  the  LP-2  were 
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Figure  11.  Mayo’s  /|n  T  —  fo  feature  space  setup.  Note  that  the  input  is  hutted 
against  the  CCJII  [12:22]. 
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igure  12.  Optical  setup  for  /|n  r  -  fg  CT.  Note  the  addition  of  the  imaging 
L\  and  L2  «vs  ari  improvement  over  Mayo’s  design. 
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Figure  13.  Photograph  of  the  In  r  —  0  CT  of  a  small  square.  Note  the  symmetry 
and  similarity  in  the  heights  of  the  peaks. 

varied,  the  image  of  the  CT  was  observed  to  drastically  change  shape.  If  the  input 
image  was  too  high  with  respect  to  the  center-to-center  alignment,  the  CT  would  fold 
upward.  Similarly,  when  the  input  image  was  too  low,  the  CT  would  fold  downward. 
Finally,  a  left  or  right  shift  off  of  the  center-to-center  alignment  would  twist  the  CT 
as  if  it  were  being  modulated  by  a  sine  wave.  Figures  14  and  15  show  photographs  of 
the  out-of-alignment  patterns  just  discussed,  as  well  as,  a  photograph  of  a  correctly 
aligned  CT. 

Focusing  the  coordinate  transforming  system  was  a  two-step  process  in  itself. 
In  the  first  step,  the  CCD  camera  was  positioned  in  place  of  the  CCII  and  used  to 
view  the  magnified  image  of  the  |FT|2  input.  The  z-axis  adjust  merit  of  the  LP-2  lens 
holder  was  then  varied  until  a  sharply  focused  image  was  observed  on  the  monitor. 
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In  the  second  step,  the  CGH  was  returned  to  its  initial  position,  the  input  slide  was 
removed  from  LP-2,  and  the  CCD  was  returned  to  the  back  focal  plane  of  £3.  Now, 
to  ensure  that  the  CCD  was  in  the  focal  plane  of  F3  (which  meant  that  the  CT 
was  in  focus),  the  z-axis  position  of  the  CCD  camera  was  varied  while  the  monitor 
was  observed.  Since  the  input  had  been  removed,  the  CGH  (which  is  square)  was 
being  completely  illuminated  and  the  CT  observed  was  that  of  a  square.  When  the 
square’s  CT  was  in  focus,  all  of  its  peaks  were  of  equal  height  (see  Figure  13).  II  the 
CCD  is  out  of  the  focal  plane,  the  heights  of  the  peaks  will  not  be  uniform.  Figure 
16  shows  photographs  of  out-of-focus  CTs  of  a  square. 

3-4-4  Space-Bandwidth  Product  As  mentioned  earlier,  the  FTs  had  to  be 
magnified  by  a  factor  of  three  to  overcome  a  SBWP  problem.  The  initial  two-lens 
imaging  setup  attempted  used  two  125mm  focal  length  lenses.  This  provided  the  nec¬ 
essary  plane  wave  generation,  but  only  accomplished  a  one-to-one  imaging  onto  tin' 
CGH.  Thus,  because  the  \FT\ 2  on  the  Polaroid  slide  film  was  small  (approximately 
2-3  mm),  the  image  on  the  CGH  was  small.  Apparently,  because  of  the  SBWP 
associated  with  the  interferogram  (35  lines/mm,  10mm  x  10mm),  sufficient  energy 
from  the  |FT|2  was  not  being  diffracted  into  the  f\n  r  —  fe  CT  feature  space.  Similar 
results  were  being  observed  with  digital  simulations  and  it  was  not  until  the  |/’7  |" 
was  magnified  that  the  expected  CT  was  observed.  A  magnification  factor  of  three 
was  chosen  because  it  provided  a  good  CT  without  causing  too  many  of  the  lower 
orders  of  the  FT  to  be  cut  off  by  the  physical  extent  of  the  CGH.  Figures  17  and 
18  are  a  series  of  four  photographs  which  explicitly  show  the  SBWP  problem.  I  he 
photographs  show  the  CTs  of  circular  apertures  of  varying  sizes  that  were  imaged 
onto  the  CGH.  It  can  be  seen  that  the  resolution  degrades  as  the  aperture  size  is 
reduced.  At  an  aperture  size  of  one  millimeter,  the  CT  has  completely  deteriorated. 

3-4-5  Linearity  One  last  aspect  of  the  CGH  that  was  explored  was  its  linear¬ 
ity  with  respect  to  scale  and  rotation  changes.  Rotation  linearity  was  inherent  and 
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Figure  16.  Out-of-focus  CTs  of  a  square  (a)  pre-focal  plane,  (b)  post-focal  plane. 

Note  the  higher  inner  peaks  in  (a)  and  the  higher  outer  peaks  in  (b). 
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Figure  17.  CTs  of  circular  apertures  where  (a)  is  of  an  8mm  aperture  ami  (b)  i*  oi 
a  4mm  aperture.  Note  how  the  CT  has  shifted  down  in  (b). 
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Figure  18.  CTs  of  circular  apertures  where  (a)  is  of  a  2mm  aperture  and  (b)  is  of 
a  1mm  aperture.  Note  the  deterioration  in  these  CTs. 
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Figure  19.  CT  of  square  rotated  45  degrees.  Note  linear  shift  along  horizontal  (Jo) 
axis  with  respect  to  Figure  13. 

obvious  (see  Figure  19),  but  scale-change  linearity  needed  to  be  verified.  To  do  t  his, 
the  optical  setup  shown  in  Figure  20  was  used  to  test  the  CGH’s  response  to  various 
sized  circular  apertures. 

A  mechanical  iris  was  used  as  a  variable  aperture  source.  The  diameter  of 
the  aperture  was  varied  from  one  millimeter  to  nine  millimeters  in  one-millimeter 
increments.  At  each  aperture  setting,  the  location  of  the  CT  (a  horizontal  "line  lor 
a  circle)  on  the  monitor  was  marked  and  recorded.  Results  of  these  measurements 
are  recorded  in  Figure  21(a)  and  shown  on  the  graph  in  Figure  21(b).  when'  the 
CGII  shift  is  shown  to  be  nearly  linear  for  aperture  sizes  of  four  millimeters  and 
above.  The  theoretical  shift  values  shown  in  Figure  21(a)  are  based  on  (he  ln(«) 
shift  expected  from  Equation  5  and  referenced  to  two  millimeters.  No  value  for  a 
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Figure  21.  Table  and  Graph  of  aperture  size  versus  distance  from  zero  reference. 

(a)  Table  shows  maximum  error  of  5.1%,  (b)  Graph  shows  linear  re¬ 
gion  above  4mm.  Note:  The  dotted  line  extrapolation  shows  t  he  linear 
portion  of  the  In  curve. 
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one  millimeter  aperture  was  possible  because  the  CT  had  deteriorated.  Thus,  the 
two  millimeter  reading  was  used  as  the  reference  for  relative  shift.  Notice  that  the 
largest  percent  error  was  only  5.1%,  showing  that  the  CGH  does,  in  fact,  give  the 
expected  In  (a)  shift. 

3.4-6  Data  Storage  As  with  the  FT ,  two  types  of  data  needed  to  be  recorded. 
First,  the  CCD  camera,  with  the  framegrabber,  was  used  to  save  images  to  disk  in 
the  same  manner  as  discussed  earlier.  Second,  when  mirror  M2  (see  Figure  12  was 
rotated  into  the  optical  pathway,  the  CT  was  focused  onto  the  filmplane  of  the 
35mm  camera.  Focusing  was  accomplished  by  viewing  the  CT  of  a  square  through 
the  camera’s  viewfinder,  and  then  translating  the  camera  until  the  peaks  of  the  C'T 
were  of  uniform  height.  The  Polaroid  HC-135  instant  slide  was  again  used  to  record 
the  CTs. 

Once  all  of  the  CTs  were  recorded,  selected  slides  of  the  | /i„  r  —  fg  |2  feature 
space  were  used  to  optically  generate  Vander  Lugt  filters  for  correlation  experiments. 
These  procedures  are  the  topic  of  the  next  section  of  this  chapter. 

3.5  Correlating  The  Feature  Space 

3.5. 1  Vander  Lugt  Filtering  In  1963,  A.  B.  Vander  Lugt  proposed  and  demon¬ 
strated  a  coherent  light  technique  for  frequency- plane  matched  filtering.  The  beauty 
of  his  technique  is  that  both  the  amplitude  and  the  phase  of  a  transfer  function  can 
be  recorded  on  film  as  an  amplitude  function  (or  interferogram).  Furthermore,  this 
spatial  frequency  information  can  be  physically  compared  to  the  spatial  frequency 
information  of  an  input  image.  Why  is  this  technique  so  significant?  In  the  spatial 
frequency  domain,  both  amplitude  and  phase  information  arc  needed  to  completely 
characterize  a  signal.  The  amplitude  frequency  spectrum  only  describes  part  of  the 
frequency  response  of  a  system.  It  is  the  the  phase  information  that  determines  the 
position  of  these  spatial  frequencies  in  the  impulse  response  of  the  optical  system. 
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Figure  22.  Optical  Schematic  for  generating  Vander  Lugt  filter  [6:172]  . 

Figure  22  shows  a  typical  schematic  for  generating  a  Vander  Lugt  filter.  A 
transparency  of  a  desired  impulse  response  ( h )  for  an  optical  system  is  illuminated 
with  a  collimated,  coherent  light  source  at  plane  P\  .  Lens  L2  ,  located  one  focal 
length  (/)  in  front  of  the  transparency,  Fourier  transforms  the  transparency  with 
the  resultant  scaled  transfer  function  (djH(^-,  ^r))  (note  bold  faced  type  indicates 
complex  function)  focused  on  a  high  resolution  photographic  plate  at  P2  ■  While 
the  film  is  exposed  with  the  transfer  function,  the  film  is  also  being  exposed  with 
a  plane  wave  at  an  angle  0  with  respect  to  the  path  of  the  illuminated  impulse 
response.  This  plane  wave  Ur  is  commonly  called  the  “reference  beam”,  whereas 
the  illumination  of  the  transfer  function  is  called  the  “object  beam”.  Note  that  the 
path  length  difference  between  the  object  and  reference  beams  must  be  within  the 
coherence  length  of  the  light  source  to  ensure  an  interference  pattern  between  those 
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beams  is  generated  on  the  film  plane. 


The  reference  beam  is  described  by 

Ur(x2,y2)  =  r0exp( - - - )  (13) 

where  r0  is  the  amplitude  of  the  reference  beam  and  the  exponential  term  is  the 
phase  difference  along  the  reference  beam  at  the  film  plane  P2  due  to  the  angle  of 
incidence  9 .  The  resultant  intensity  distribution  (I)  at  P2  is  equal  to  the  square 
of  the  sum  of  the  amplitudes  of  the  reference  and  object  beams: 

Hx2,y2)  =  t0  exp( - - - 

=  rl  +  +  JjHcxpijixaij,) 

+  ^jH*exp(-j27TQy2)  (11) 

where  a  =  sin  0/ X. 

In  order  to  demonstrate  the  amplitude  and  phase  retention  of  the  Vander 
Lugt  interference  technique,  the  transfer  function’s  wavefront  can  be  rewritten  as  an 
amplitude  function  multiplied  by  a  phase  function: 


*2  y2 


*2  y  2 


H(^,f7)  =  ^T7)e*P  N0 


A/’ A/ 


A/’ A/ 


X7  V2 
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Substituting  Equation  15  into  Equation  14  yields  a  very  descriptive  equation  for  the 
intensity  distribution  of  the  interference  pattern  at  the  film  plane: 


>(x2,y2)  =  rj  +  {w  fj 

~r°  a  ( x2  y2\___ 


A  /  VA/’A/ 


+  T7A  T7 ’  T7  cos  27ra^  -  *  T7’ 
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This  equation  shows  that  the  interference  pattern  is  the  transfer  function's  amplit  tide 
and  phase  functions  modulated  onto  a  high  frequency  carrier. 

Exposing  this  interference  pattern  to  a  photographic  plate  yields  a  trans¬ 
parency  with  an  amplitude  transmittance  function  t(x2,y2)  with  a  distribution 
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Figure  23.  Optical  schematic  to  generate  optical  correlation  [6:  i 67]  . 

proportional  to  intensity  functions  listed  in  Equations  14  and  16.  By  next  remov¬ 
ing  the  reference  beam  and  the  impulse  response  transparency  at  Px  ,  this  optical 
processing  schematic  can  be  modified  to  perform  signal  processing.  The  post-filter 
generation  optical  schematic  is  shown  in  Figure  23.  In  Figure  23,  the  optical  sys¬ 
tem’s  impulse  response  transparency  is  replaced  with  a  transparency  of  an  input 
signal  g(xx,yx)  to  be  processed  (filtered)  by  the  system.  In  addition,  a  Fourier 
transforming  lens  L3  is  added  to  the  output  of  the  Vander  Lugt  filter  to  convert 
the  processed  signal  back  to  the  space  domain. 

The  input  signal  <j{x\,y\)  is  processed  in  the  following  manner.  First.,  the 
input  signal  is  Fourier  tranformed  onto  the  Vander  Lugt  filter  at  I\  .  This  is 
equivalent  to  multipling  the  input  signal’s  frequency  information  with  the  amplitude 
distribution  of  the  filter.  Therefore,  the  resultant  field  U2  transmitted  through  the 
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filter  is 
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Next,  U2  is  Fourier  transformed  with  a  resultant  field 


U3(x3,y3)  oc  r2g(x3,j/3)  +  2/3)  *h*(-x3,-y3)  *g(^3,2/3)] 

V 

+  ^y[h(x3,  y3)  *  g(x3,  t/3)  *  6{x3,  ?/3  +  oA/)] 

T 

+T7[h*(-X3’  ~y^  *  g(X3’ ^ x 3’ 2/3  ~  aA/)] 
AJ 


(18) 


The  third  and  fourth  terms  of  Equation  18  are  the  most  important  resuits  of  this 
optical  system.  The  third  term  yields  the  complex  convolution  of  the  input  image 
with  the  desired  impulse  response  centered  at  coordinates  (0,  —aXf)  in  plane  P3  . 
The  fourth  term  yields  the  crosscorrelation  between  the  input  image  anil  the  impulse 
response  centered  at  the  coordinates  (0,  a\f)  in  P3  . 

The  Vander  Lugt  filter’s  inherent  ability  to  generate  two-dimensional  cross  cor¬ 
relations  at  light  speed  indicates  that  this  technique  may  prove  very  useful  in  Inline 
attempts  at  building  a  near-real-time  pattern  recognition  machine.  For  crosscorre¬ 
lations  between  the  feature  spaces  of  a  target  template  and  an  image,  the  template 
feature  space  is  the  desired  impulse  response  and  the  image  feature  space  is  the  in¬ 
put  signal  to  be  processed.  For  this  thesis,  Vander  Lugt  filters  were  generated  for 
the  Fourier  transform  /inr  —  fo  feature  space  of  target  templates  and  then  used  to 
correlate  the  feature  spaces  of  test  images.  To  create  Vander  Lugt  filters  optically, 
holographic  techniques  requiring  chemical  developing,  fixing,  and  bleaching  are  nor¬ 
mally  used.  In  this  thesis,  however,  a  thermoplastic  holographic  medium  was  used 
to  make  the  filters. 


3.5.2  Thermoplastic  Holograms  Several  types  of  media  are  available  for  record¬ 
ing  Vander  Lugt  filter  interference  patterns.  The  most  common  filter  media  are  the 
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Figure  24.  Cross  section  of  thermoplastic  film.  Note:  The  thermoplastic  layer  is 
only  0.7 pm  thick  [15]  . 

holograhic  and  high  resolution  photograhic  films.  But  these  materials  require  wet 
chemical  processing  for  development.  This  research  was  conducted  using  a  Newport 
HC-300  thermoplastic  holographic  recording  device.  This  device  is  convenient  in  that 
it  can  generate  a  hologram,  or  for  this  research  a  matched-filter,  in  approximately 
60  seconds  without  wet  chemical  processing.  Additionally,  the  film  is  reuseable  up 
to  1000  times.  The  HC-300  consists  of  a  camera  for  holding  and  processing  the  ther¬ 
moplastic  media,  a  shutter  system,  a  power  sensor  for  automatic  shuttering,  and  a 
system  controlling  unit. 

The  thermoplastic  film  (plate)  is  a  multilayered  media  that  has  been  deposited 
on  a  quartz  substrate  (see  Figure  24).  The  film  consists  of  three  layers:  thermoplas¬ 
tic,  photoconductor,  and  transparent  electrical  conductor.  Generating  a  thermoplas¬ 
tic  hologram  is  a  four  step  process  (see  Figure  25). 

First,  a  uniform  charge  is  deposited  on  the  film  using  a  moveable  coronation 
located  inside  the  camera.  As  shown  in  Figure  26, the  coronatron  consists  of  a  wire 
anode  located  on  the  central  axis  of  a  slotted  cylinder.  The  anode  voltage  is  raised 
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Heating  of  the  thermoplastic  causes  permanent  deformation.  The 
irregularities  will  diffract  light  to  recreate  the  original  image. 
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Figure  26.  Coronatron  charging  a  thermoplastic  plate  [15]  . 

to  4  kilovolts  while  the  cylinder  and  the  thermoplastic  plate  are  grounded.  The  high 
anode  voltage  ionizes  the  air  allowing  electrons  to  collect  on  the  anode  while  ions  are 
collected  on  the  cylinder.  As  the  coronatron  moves  across  the  thermoplastic  plate, 
some  of  the  newly  generated  ions  slip  through  the  cylinder’s  slot  (which  is  positioned 
next  to  the  plate)  and  charge  the  medium. 

The  second  step  is  to  expose  the  plate  with  an  object  and  reference  beam.  The 
plate’s  photoconductor  layer  allows  negative  charges  to  pass  from  the  transparent 
conductor  toward  the  thermoplastic.  It  should  be  noted  that  even  though  charge 
has  migrated,  a  uniform  electric  field  still  exists  across  the  plate. 

During  the  third  step,  the  plate  is  re-ionized  by  the  coronatron.  Since  the 
exposure  has  terminated,  the  second  charging  allows  more  charge  to  build  up  on  the 
exposed  areas  of  the  plate. 


Finally,  a  current  is  passed  through  the  transparent  electrode  of  the  plate  to 


create  heat  for  softening  the  thermoplastic.  While  the  thermoplastic  is  soft,  the 
plate’s  areas  that  were  previously  illuminated  will  be  compressed  by  the  increased 
electrostatic  field  produced  by  the  plate’s  second  charging.  The  plate  areas  that 
were  exposed  less  will  tend  to  bulge  out.  The  result  is  a  phase-relief  hologram  with 
a  diffraction  efficiency  up  to  10  percent. 

3.5.3  Experimental  Setup  for  Correlations  Figure  27  shows  the  optical  con¬ 
figuration  used  to  demonstrate  optical  correlations  of  the  Fourier  transform  In  r  —  0 
(FLRT)  feature  space.  The  first  step  was  to  generate  a  Vander  Lugt  filter  of  the 
FLRT  feature  space  of  a  target  template.  The  coherent  light  source  was  a  35  mil¬ 
liwatt  helium-neon  laser  (632.8  nm  wavelength).  The  laser’s  coherence  length  was 
measured  (using  the  Michaelson  interferometer  technique)  to  be  approximately  38 
cm.  The  reference  and  object  beam  pathlengths  were  adjusted  to  within  0.5  cm.  As 
recommended  in  Newport’s  HC-300  holographic  recording  device  operating  manual, 
the  object  beam  was  placed  at  an  angle  of  31  degrees  with  respect  to  the  reference 
beam,  and  the  object  beam  was  attenuated  with  neutral  density  filters  down  to  a 
1:10  power  ratio  with  respect  to  the  reference  beam.  The  target  and  test  image's 
FLRT  feature  spaces  were  previously  generated  and  recorded  on  35  mm  Polaroid 
Polagraph  high  contrast  black  and  white  slide  film  using  the  same  techniques  de¬ 
scribed  for  FTs  and  CTs.  Using  the  HC-300  in  automatic  mode,  a  thermoplastic 
plate  was  exposed  and  developed  as  a  matched  filter  for  the  target  template’s  FLRT 
feature  space.  The  diffraction  efficiency  of  the  filters  ranged  from  1-3  percent,  which 
is  lower  than  the  10  percent  value  published  by  Newport  Corporation  [15]. 

Next,  the  Vander  Lugt  filter  was  used  to  correlate  test  image  FLRT'  feature 
spaces  with  the  target’s  FLRT  feature  spaces.  Note  that  since  the  HC-300  holo¬ 
graphic  camera  also  acts  as  the  filter’s  holder  after  development,  the  filter  was  not 
moved  during  the  correlation  experiment.  To  generate  the  correlated  feature  spaces, 
the  feature  space  slide  used  to  create  the  matched  filter  was  replaced  with  a  feature 
space  slide  of  a  test  image.  The  reference  beam  was  blocked  from  the  matched  fil- 
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Figure  27.  Optical  setup  for  correlating  the  FLRT  feature  space. 


ter,  and  the  object  beam  illuminated  the  test  image  slide.  The  object  beam  travels 
through  the  transparency  and  a  Fourier  transforming  lens  to  the  matched  filter.  At 
the  matched  filter,  the  beam  is  diffracted  into  three  separate  beams.  The  beam  that 
contains  the  correlation  information,  located  31  degrees  away  from  the  object  beam’s 
path,  travels  along  the  extended  reference  beam  path.  The  correlation  information 
passes  through  a  Fourier  transform  lens  and  is  then  imaged  onto  a  CCD  camera 
where  the  correlation  is  viewed  and  digitally  recorded.  Figure  28(a)  shows  the  opti¬ 
cally  generated  autocorrelation  function  of  a  square.  In  Figure  28(b),  the  correlation 
function  of  (a)  has  been  digitally  profiled  into  the  3-D  surface  plot  shown. 

Since  correlation  peak  location  was  essential  for  determining  the  scale  and  ro¬ 
tation  of  a  target,  alignment  of  the  feature  space  transparencies  was  very  important. 
Figure  29  shows  that  the  optically  generated  feature  spaces  all  have  an  artifact  in 
common.  When  this  artifact  was  enlarged,  it  was  found  to  be  the  FT  of  the  image 
that  was  originally  incident  on  the  computer  generated  hologram.  Since  the  magni¬ 
tude  of  the  Fourier  transform  is  position  invariant,  this  artifact  was  located  at  the 
same  exact  position  of  every  FLRT  feature  space.  Therefore,  this  artifact  was  used  to 
ensure  that  each  test  feature  space  was  aligned  at  the  same  position  of  the  template 
feature  space  used  to  generate  the  matched  filter.  Note  that  in  Miazza’s  architecture 
(see  Figure  5),  image  alignment  is  not  a  factor  because  spatial  light  modulators  are 
used  instead  of  transparencies. 

3.6  Digital  Simulations 

To  ensure  the  validity  of  the  optical  generation  and  correlation  of  the  FLUT 
feature  space,  the  optic  al  coordinate  transforms  were  simulated  and  correlated  digi¬ 
tally.  A  digital  model  of  a  In  r  —  0,  phase-only  filter  simulated  the  optical  coordinate 
transform.  Note  that  the  optical  architecture  used  in  this  thesis  used  a  computer 
generated  hologram  (CG11).  The  CGH  is  actually  a  binarized  interferogram  of  the 
phase-only  filter’s  output  wavefront  interfered  with  a  plane  wave  reference  beam. 
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Figure  29.  CGH  artifact  where  arrow  points  to  artifact  used  to  align  the  feature 
space  for  correlating. 

The  CGH  produces  a  coordinate  transform  at  the  first  diffracted  order  of  its  output: 
whereas,  the  phase-only  filter’s  transform  is  located  on  the  optical  axis. 

“Hologram_menu”,  a  software  package  written  by  Marvin  Hill,  was  adapted  to 
digitally  simulate  optical  In  r—0  coordinate  transformations.  Appendix  E  presents 
pertinent  portions  of  Hologramjnenu’s  code  used  in  this  thesis.  Initially,  computer 
generated  Fourier  transforms  of  the  256  by  256  pixel  test  images  were  used  with 
Hologram_inenu  to  generate  the  FLRT  feature  space.  However,  Hologram jneuu's 
fast  Fourier  transform  routine  created  transforms  that  were  too  small  for  the  digital 
phase-only  filter  to  generate  a  recognizable  coordinate  transform.  Consequently, 
optical  Fourier  transformed  images  generated  by  the  procedures  described  earlier 
in  this  chapter  were  recorded  digitally  using  a  framegrabber  (see  Appendix  D)  and 


then  used  as  the  input  for  Hologram_menu.  A  FLRT  feature  space,  generated  by 
Hologram_menu,  of  a  small  square  rotated  by  45  degrees  is  shown  in  Figure  30. 
Note  that  this  feature  space  has  an  additional  coordinate  transform  of  a  square 
draped  across  the  top  of  the  expected  result.  This  additional  energy  was  caused 
by  the  limiting  aperture  of  the  input  image  and  the  digital  phase-only  filter.  The 
optically  generated  Fourier  transformed  images  contained  some  energy  at  the  edges 
of  the  256  by  256  pixel  (square  image)  array;  and  therefore,  this  energy  was  also 
mapped  by  the  phase  function.  This  additional  intensity  pattern  was  observed  in 
all  digitally  coordinate  transformed  Fourier  transforms.  This  redundant  intensity 
pattern  posed  additional  problems  in  that  digital  correlations  of  feature  spaces  were 
dominated  by  the  pattern’s  autocorrelation  peak.  To  allieviate  this  problem,  the 
optically  generated  Fourier  transforms  were  multiplied  with  a  Hamming  function 

w(n)  =  0.54  —0.46  cos  ,  0  <  n  <  N  —  1  (10) 

The  result  of  preprocessing  the  optical  Fourier  transform  with  the  Hamming  function 
is  shown  in  Figure  30  (b). 

Kobel  and  Martin’s  computer  program  “Executive”,  was  used  to  generate'  linear 
correlations  of  the  feature  spaces  created  with  Hologram_menu.  A  crosscorrelatiou 
of  the  FLRT  feature  spaces  of  a  large  square  with  a  square  scaled  by  a  half  and 
rotated  by  45  degrees  is  shown  in  Figure  31. 

3.7  Summary 

In  this  chapter,  optical  implementation  and  digital  simulation  of  portions  of 
the  Kobel-Martin-Horev  algorithm  were  discussed.  Section  3.3  detailed  the  Fourier 
transforming  property  of  lenses  and  explained  how  a  dc-block  was  used  to  increase 
resolution  in  the  optical  system.  This  section  also  related  two  techniques  that  pro¬ 
vided  precise  alignment  of  the  dc-block  and  focusing  of  the  CCD  camera. 
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Section  3.4  discussed  the  use  of  a  CGH  to  perform  a  In  r  —  9  CT.  Several 
important  improvements  over  previous  attempts  at  optical  implementation  are  de¬ 
tailed.  These  include  imaging  onto  the  CGH  instead  of  butting  the  FT  up  against 
the  CGH,  magnifying  the  FT  to  overcome  a  SBWP  problem,  and  using  the  CT  of 
a  square  to  ensure  system  alignment  and  focus.  Additionally,  linearity  of  the  CGH 
CT  was  examined  and  shown. 

Vander  Lugt  filtering  and  thermoplastic  holography  were  discussed  in  Section 
3.5.  Optical  correlation  procedures  were  detailed  and  examples  given.  It  was  noted 
that  an  artifact  from  the  CGH  CT  process  could  be  used  in  aligning  the  matched- 
filter  for  correlations. 

Finally,  Section  3.6  briefly  discussed  digital  simulations  used  to  verify  optical 
results.  Examples  of  digitally  produced  CTs  and  correlations  were  provided.  Chap¬ 
ter  IV  follows  with  results  based  on  experimentation  using  the  optical  and  digital 
methods  just  described. 
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IV.  Experimental  Results 


4-1  Introduction 

This  chapter  will  show  and  discuss  results  of  the  experiments  performed  to 
optically  implement  portions  of  the  Miazza  design.  In  addition,  the  optical  results 
will  be  compared  to  results  gained  from  digitally  implementing  those  same  portions 
of  Miazza’s  design.  In  the  next  section  of  this  chapter,  results  obtained  from  optical 
coordinate  transforming  will  be  examined.  Following  this  will  be  a  section  detailing 
the  results  obtained  by  optically  correlating  the  feature  spaces  of  various  input  im¬ 
ages.  Both  autocorrelations  and  crosscorrelations  will  be  analyzed.  Finally,  a  short 
summary  of  the  results  will  be  given. 

4-2  Optical  FLRT  Feature  Space  Results 

The  Fourier  transform  In  r  —  0  (FLRT)  feature  space  was  generated  using 
the  techniques  detailed  in  sections  2.4  and  3.4.  Figures  32-42  present  photographs 
of  several  input  scenes,  their  optically  generated  Fourier  transforms  (intensity  distri¬ 
butions),  and  their  optically  generated  FLRT  feature  spaces  (coordinate  transforms 
(Cl's)),  which  were  generated  using  the  computer  generated  hologram  (CG11).  Also 
included  in  each  figure  is  the  respective  computer  generated  FLRT  feature  space, 
which  was  generated  using  a  digital  mode!  of  an  optical,  phase-only  filter  to  perform 
the  coordinate  transformation. 

Figures  32-34  present  the  development  of  the  FLRT  feature  space  for  a  square, 
a  square  scaled  by  one-half,  arid  the  scaled  square  rotated  by  45"  .  Notice  that  the 
diffracted  orders  of  the  Fourier  transform  of  the  smaller  square  were  resolvable  in 
the  photograph,  compared  to  the  diffracted  orders  of  t  he  transformed  larger  square 
which  were  not  resolvable.  Actually,  resolvability  was  more  a  function  of  the  device 
or  medium  recording  the  image.  The  high  contrast,  black  and  white  slide'  film  used 
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to  record  the  image  intensity  distributions  (which  removed  the  phase  information 
from  the  image  before  further  optical  processing)  demonstrated  an  ability  to  resolve 
the  diffracted  orders  of  the  Fourier  transformed,  large  square. 

In  each  case  investigated,  the  optical  FLRT  feature  spaces  were  very  similar 
in  appearance  to  the  computer  generated  FLRT  feature  spaces.  One  difference, 
however,  was  the  optical  artifact  (ultimately  used  as  a  reference  point  to  align  FLRT 
feature  spaces  for  correlation)  generated  by  the  CGH  that  was  not  present  in  the 
digitally  generated,  op4  ical  feature  spaces  (see  Figure  29  in  Chapter  3).  The  absence 
of  this  artifact  in  the  digital  modeling  supports  the  idea  that  the  artifact  is  due  to  the 
binary-periodic  nature  of  the  CGH,  which  is  the  binarized  interference  pattern  of  the 
phase-only  filter  with  a  plane  wave.  The  binary  nature  of  the  CGII  allows  some  of  t  he 
light  to  pass  through  without  being  diffracted.  The  light  from  an  image  that  passes 
through  unhindered  is  a  sampled  version  of  the  original  image  T(a,  fl)  incident  on 
the  hologram.  The  sampling  can  be  modeled  as  the  multiplication  of  an  image  with 
a  two-dimensional  rectangle  function  (rect)  convolved  with  a  two-dimensional  array 
of  delta  functions  commonly  called  a  comb  function. 

Ta(aJ. 3)  =  T{a,  (3)[comb(a,  fi)  *  rect(aj3)\  (20) 

The  sampled  image  is  then  transformed  by  the  Fourier  transforming  lens  to  yield: 

tsifaJe)  =  t{faJp)  *  ( sinc{fa,f0)comb{fa,fl3 )]  (21) 

The  final  result  is  the  Fourier  transform  of  the  image  incident  on  the  CGH  convolved 
with  a  sampled  sine  function.  Consequently,  the  resulting  pattern  is  a  periodic 
repetition  of  the  Fourier  transformed  image  modulated  by  a  sine  pattern.  For  this 
research,  T(a,ft)  was  the  magnitude  squared  of  the  Fourier  transform  of  the  original 
input  L,{x,y )  .  Therefore,  the  artifact,  being  the  Fourier  transform  of  T(a,  ;t)  ,  can 
be  mathematically  expressed  as 

Ufa,  Iff)  =  F  [7i(«,  0)T*  (a,  3)} 

=  C(*,2/)  *t'  (x,y)  (22) 
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Figure  32 

Development  of  the  FLRT  CT  of  a  square.  Photograph  (a)  is  t  he  input 
template  (top  left),  the  optical  FT  of  the  template  (top  right),  opt  ically 
generated  FLRT  CT  (bottom).  Photograph  (b)  is  the  digitally  modeled 
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optical  FLRT  CT. 
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Figure  33.  Development  of  the  FLRT  CT  of  a  square  scaled  by  a  half.  Photograph 
(a)  is  the  input  template  (top  left),  the  optical  FT  of  the  template 
(top  right),  optically  generated  FLRT  CT  (bottom).  Photograph  (b)  is 
the  digitally  modeled  optical  FLRT  CT.  Note  the  similarities  between 
Figure  32  and  the  above  FLRT  CT  even  though  their  exists  a  scale 
difference  of  two  between  the  corresponding  FT  templates. 
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Figure  34.  Development  of  the  FLRT  CT  of  a  square  sealed  by  a  half  and  rotated 
by  45°  .  Photograph  (a)  is  the  input  template  (top  left),  the  optical  FT 
of  the  template  (top  right),  optically  generated  FLRT  CT  (bottom). 
Photograph  (b)  is  the  digitally  modeled  optical  FLRT  C'T.  Note:  A 
rotation  in  the  template  (see  Figure  33)  yields  a  linear  shift  in  the  /,? 
(horizontal)  axis. 
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which  is  the  autocorrelation  of  the  original  input  transparency. 

A  similarity  between  the  optical  coordinate  transforms  and  the  digitally  mod¬ 
eled  transforms  is  an  apparent  non-linearity  in  mapping  (or  unwrapping)  of  energy 
along  the  fg  (horizontal)  axis.  In  Figure  32(a),  the  legs  of  the  FT  of  the  square  were 
mapped  into  three  full  legs  and  two  half  legs  along  the  fg  axis  of  the  FLRT  feature 
space.  The  energy  of  the  center  leg  and  the  two  half  legs,  which  correspond  to  the 
uppermost  and  lowermost  legs  of  the  FT,  is  more  dispersed  than  the  two  outside  (left 
and  right  of  center)  legs.  However,  in  Figure  34,  where  the  square  is  rotated  by  45°  , 
the  energy  in  the  FLRT  feature  space  appears  to  be  mapped  uniformly  across  the 
fg  axis.  These  results  indicate  that  energy  is  not  linearly  mapped  by  the  CGH  along 
the  fg  axis.  The  results  also  indicate  that  energy  incident  on  the  left  and  right  sides 
of  the  CGII  (  0  «  90°  or  270°  )  is  mapped  tighter  than  energy  incident  on  the  top 
and  bottom  portions  of  the  CGH.  Furthermore,  energy  incident  at  the  four  corners 
of  the  CGH  (  0  as  45°,  135°,  225°  and  315°  )  appears  to  be  evenly  mapped.  Note, 
however,  that  the  legs  of  the  optical  CT  shown  in  Figure  34  have  beveled  tops.  This 
result  occurs  because  the  FT  had  to  be  magnified  to  overcome  the  SBWP  problem 
discussed  in  section  3.4.4.  When  the  Fourier  transform  was  magnified,  the  spatial 
extent  of  its  legs  was  such  that  some  of  the  higher  diffracted  orders  were  cut  olT  by 
the  edges  of  the  CGH.  Therefore,  that  lost  energy  was  not  coordinate  transformed. 
In  the  particular  case  of  the  rotated  square,  the  four  legs  of  the  FT  were  each  cutoff 
at  a  45°  angle,  and  thus,  the  legs  of  the  CT  are  beveled  at  45°. 

Note  also,  that  a  rotation  in  the  input  generates  a  cyclic  shift  along  the  J'g 
axis  of  the  FLRT  feature  space.  In  Figure  32,  the  distance  between  the  logs  of  the 
FT  along  the  fg  axis  of  the  FLRT  feature  space  is  90°.  The  FLRT  feature  space 
in  Figure  34  shows  that  a  rotation  in  the  input  by  45°  causes  a  cyclic  shift  of  the 
feature  space  by  a  distance  equivalent  to  half  the  distance  between  adjacent  legs  in 
the  feature  space,  or  45°  .  Therefore,  the  half  legs  observed  in  Figure  32  eycliely 
shift  and  combine  to  yield  four  whole  legs  in  the  rotated  square's  feature  space. 
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Figures  35-42  are  FLRT  feature  spaces  of  military  vehicles  and  aircraft.  Figures 
35  and  36  present  the  development  of  the  FLRT  feature  spaces  of  a  tank  and  a  tank 
scaled  by  three-fourths.  The  input  template  of  the  tank  is  very  similar  in  geometry 
to  a  rectangle  and  an  ellipse.  Consequently,  the  intensity  distribution  of  the  FT 
of  a  tank  can  be  analyzed  based  on  similarities  to  these  simple  geometric  shapes. 
As  shown  in  Figure  35,  the  FT  of  a  tank  produces  a  prominent  vertical  narrow 
band  of  energy  caused  by  its  narrow  height  compared  to  its  length.  Also,  there  is 
a  small  amount  of  energy  due  to  the  tank’s  narrow  barrel  being  transformed  at  an 
angle  to  the  vertical  band,  but  it  is  difficult  to  discern  from  the  photograph.  The 
tank’s  elliptical  shape  generates  FT  energy  in  a  scattered  elliptical,  quasi-Airy  disk 
pattern.  Consequently,  the  FLRT  feature  space  of  a  tank  has  one  full  leg  in  the 
middle  corresponding  to  the  energy  at  the  top  of  the  FT  and  two  partial  legs,  which 
when  combined,  correspond  to  the  vertical  band  of  energy  at  the  bottom  of  the  FT. 
Since  all  of  the  vehicle  templates  tested  had  a  general  rectangular  shape,  their  FTs 
and  corresponding  FLRT  feature  spaces  were  all  similar  in  appearance,  with  eacli 
feature  space  having  two  prominent  legs  of  energy.  Figures  39-12  present  the  FLRT 
feature  spaces  of  input  scenes  with  multiple  image  types  present. 

4-3  Correlation  Results 

The  main  thrust  of  this  research  was  to  show  and  verify  the  predictability 
of  optically  correlating  a  position,  scale,  and  rotation  invariant  feature  space  using 
Vander  Lugt  matched-filters.  The  predictable  characteristic  of  the  correlation  was 
peak  location.  A  specific  horizontal  shift  would  indicate  a  corresponding  change 
in  the  input  image’s  relative  rotation,  and  a  specific  vertical  shift  would  indicate  a 
corresponding  change  in  the  input  image’s  relative  scale. 

Figures  43-48  present  examples  of  optical  cori  Nations  of  optically  generated 
FLRT  feature  spaces  and  digital  correlations  of  computer-modeled,  optical  FLR'l 
feature  spaces  (presented  in  the  previous  section),  as  well  as,  three-dimensional  plots 
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Figure  35.  Development  of  the  FLRT  CT  of  a  tank.  Photograph  (a)  is  the  input 
template  (top  left),  the  optical  FT  of  the  template  (top  right),  optically 
generated  FLRT  CT  (bottom).  Photograph  (b)  is  the  digitally  modeled 
optical  FLRT  CT. 
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Figure  36.  Development  of  the  FLRT  CT  of  a  tank  scaled  by  a  half.  Photograph 
(a)  is  the  input  template  (top  left),  the  optical  FT  of  the  template 
(top  right),  optically  generated  FLRT  CT  (bottom).  Photograph  (b)  is 
the  digitally  modeled  optical  FLRT  CT.  Note  the  similarities  between 
Figure  35  and  the  above  FLRT  CT  even  though  there  exists  a  3/1  scale 
difference  between  the  corresponding  FT  templates. 
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(b) 

Figure  37.  FLRT  CT  of  a  truck.  Photograph  (a)  is  the  input  template  (top  left ).  the 
optically  generated  FLRT  CT  template  (top  right),  optically  generated 
FLRT  CT  of  truck  rotated  by  45°  (bottom).  Photograph  (b)  is  the 
digitally  modeled  optical  FLRT  CT.  Note:  Multiple  object  scenes  were 
tested  for  target  discrimination  performance. 


67 


Figure  38.  Development  of  the  FLRT  CT  of  an  F - 1 5 .  Photograph  (a)  is  the  input. 

template  (top  left),  the  optical  FT  of  the  template  (top  right),  optically 
generated  FLRT  CT  (bottom).  Photograph  (b)  is  the  digitally  modeled 
optical  FLRT  CT. 
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Figure  39.  Development  of  the  FLRT  CT  of  a  truck  and  tank  rotated  DO*'  with 
respect  to  each  other.  Photograph  (a)  is  the  input  template  (top  left), 
the  optical  FT  of  the  template  (top  right),  optically  generated  FLRT 
CT  (bottom).  Photograph  (b)  is  the  digitally  modeled  optical  FLRT 
CT. 
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Figure  '10.  Development  of  the  FLRT  CT  of  two  tanks  and  two  truc  ks.  Photograph 
(a)  is  the  input  template  (top  left),  the  optical  FT  of  the'  template*  (top 
right),  optically  generated  FLRT  CT  (bottom).  Photograph  ( l> )  is  tlu* 
digitally  modeled  optical  FLRT  CT. 
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(b) 


Figure  1 1 . 


Development  of  the  FLRT  C’T  of  a  tank,  truck,  and  F-lo.  Photograph 
(a)  is  the  input  template  (top  left),  the  optical  FT  of  the  template  (top 
right),  optically  generated  FLRT  CT  (bottom).  Photograph  (b)  is  the 
digitally  modeled  optical  FLRT  CT. 
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Figure  12.  Development  of  the  FLRT  CT  of  three  F-l-Vs.  Photograph  (a)  is  the 
input  template  (top  left),  the  optical  FT  of  the  template  (top  right.), 
optically  generated  FLRT  CT  (bottom).  Photograph  ( 1> )  is  the  digitally 
modeled  optical  FLRT  CT. 


of  each  correlation.  The  optical  correlations  were  performed  using  the  Vander  Lugt 
Filtering  technique  outlined  in  section  3.5.  The  computer- modeled,  optical  FLRT 
feature  spaces  were  correlated  using  a  linear  correlation  routine  written  by  Kobel 
and  Martin.  Displayed  in  the  lower-left  and  lower-right  corners  of  the  correlation 
intensity  distribution  photographs  are  the  templates  used  to  generate  the  feature 
spaces  that  produced  the  correlation.  On  the  left  is  the  template  used  to  generate 
the  Vander  Lugt  filter,  and  on  the  right  is  the  input  template. 

Optical  correlations  were  attenuated  using  neutral  density  filters  to  avoid  sat¬ 
urating  the  CCD  camera.  The  attenuation  was  held  constant  throughout  the  exper¬ 
iment  so  the  relative  intensities  of  cross-correlation  and  autocorrelat  ion  peaks  could 
be  compared.  However,  the  digital  correlation  data  was  normalized  independently 
for  each  correlation  and  compressed  into  256  discrete  levels.  Consequently,  the  digi¬ 
tal  data  could  be  evaluated  only  on  the  merits  of  peak  shift  and  shape,  and  not  on 
relative  height  (intensity). 

The  3-dimensional  intensity  plots  of  the  correlations  were  created  by  digitally 
reducing  the  correlation  images  down  to  128  by  128  pixel  images.  These  smaller  im¬ 
ages  were  then  plotted  using  a  laser  printer  and  a  plotting  routine  named  SURFKR. 

4-3.1  Target  Rotations  Figures  43-45  demonstrate  the  classic  linear  transla¬ 
tion  of  a  correlation  peak  along  the  f$  axis  due  to  an  in-plane  rotation  of  a  square 
by  45°  .  Relative  rotational  shifts  were  determined  by  measuring  the  the  distance 
the  correlation  peak  translated  along  the  fg  axis  with  respect  to  the  autocorrelation 
peak  obtained  by  correlating  the  template  C’T  used  to  make  the  matched-filter  with 
the  matched-filter.  The  fg  axis  shift  was  calibrated  based  on  the  relative  distances 
between  peaks  where  the  relative  angles  are  known.  Since  the  Fourier  transform 
of  a  square  has  “legs”  at  equal  increments  of  90°  apart,  both  the  FLRT  feature 
spare  and  the  autocorrelation  plane4  have  periodic  peaks  90f'  apart.  This  particular 


Figure  43. 


Optical  autocorrelation  of  the  FLR.T  CT  of  a  square  where  (a)  is  a  photograph  of 
the  intensity  distribution,  and  (b)  is  a  3-D  plot.  Note:  Correlations  were  attenuated 
to  avoid  saturating  CCD  camera. 
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Figure  44.  Digital  autocorrelation  of  the  computer-modeled  optical  FLRT  C  I  ol  a  square  where 
(a)  is  a  photograph  of  the  intensity  distribution,  and  (b)  is  a  3-D  plot.  .Vote:  Digital 
correlation  data  was  normalized  to  256  intensity  levels. 
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Figure  45.  Optical  correlation  of  the  FLHT  CTs  of  a  square  and  a  square  scaled  by  n  I 
rotated  45°  where  (a)  is  a  photograph  of  the  intensity  distribution,  and  (l>) 
plot.  Note  correlation  peak  shifted  46.1°  compared  to  the  autocorrelation 
Figure  43. 


alf  and 
is  a  3-0 
peak  in 
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Figure  'lfi.  Digital  correlation  of  the  computer-modeled  optical  FLRT  CTs  of  a  square  and  a 
square  scaled  by  a  half  and  rotated  -15°  where  (a)  is  a  photograph  of  the  intensity 
distribution,  and  (b)  is  a  3-D  plot.  Note  correlation  peak  shifted  compared 

to  46.1°  optically  (see  Figure  '15). 
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Figure  47.  Optical  correlation  of  the  FLRT  CTs  of  a  truck  and  of  a  truck  and  tank  rotated  1")° 
in  opposite  directions  where  (a)  is  a  photograph  of  the  intensity  distribution,  and 
(b)  is  a  3-D  plot.  Note:  Due  to  cyclic  nature  of  0  axis,  the  pseudo-autocorrelation 
peak  is  located  at  —42.7°  versus  the  highest  peak  located  at.  129.5". 
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Figure  48. 


Digital  correlation  of  the  computer-modeled  optical  FLRT  CTs  of  a  truck  and  a 
truck  and  tank  rotated  45°  in  opposite  directions  where  (a)  is  a  photograph  of 
the  intensity  distribution,  and  (b)  is  a  3-D  plot.  Similar  to  Figure  17,  tin'  pseudo¬ 
autocorrelation  peak  (second  highest)  is  located  at  —51.6°  versus  highest  peak  at 
137°. 
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autocorrelation  (see  Figure  43)  was  used  to  calibrate  the  f$  axis.  The  correlation 
between  the  CT  of  a  square  rotated  45°  and  the  matched-filter  of  an  unrotated 
square  yielded  a  relative  rotational  shift  of  46.1°  when  performed  optically  and 
46.9°  when  performed  digitally. 

Figures  47  and  48  reveal  a  weakness  in  optically  correlating  the  FLRT  feature 
space.  These  figures  show  the  correlation  plane  of  a  truck  CT  correlated  with  the 
CT  of  a  truck  and  a  tank  rotated  45°  in  opposite  directions.  Similar  to  the  case  of 
the  rotated  square  presented  above,  one  might  expect  the  rotated  truck  correlation 
plane  to  have  its  largest  peak  located  —45°  away  from  the  truck’s  autocorrelation 
peak.  With  calculations  based  on  the  largest  peak,  the  correlation  plane  indicates  a 
shift  of  129.5°  when  generated  optically  and  137.0°  when  generated  digitally.  The 
peak  that  actually  corresponded  to  the  —45°  shift  was  the  third  highest  in  intensity 
and  indicated  a  rotation  of  -42.7°  when  performed  optically  and  -51.6°  when 
performed  digitally. 

These  false  indications  exist  because  rotations  in  the  input  space  yield  cyclic 
shifts  in  FLRT  space,  whereas  optical  correlations  are  linear.  Consequently,  corre¬ 
lations  performed  in  the  FLRT  space  must  be  cyclic  along  the  fg  axis  to  ensure 
that  the  feature  spaces  can  completely  match  to  yield  a  true  autocorrelation  peak. 
Figure  49  graphically  illustrates  the  problem  of  lineaily  correlating  a  cyclic  feature 
space.  Part  (a)  illustrates  a  generic  CT  as  seen  in  the  current  2ir  space,  while  (b) 
shows  how  the  CT  of  part  (a)  would  look  in  a  47T  space.  Then,  in  part  (c)  tin.' 
result  of  autocorrelating  the  CT  of  (a)  is  shown.  It  is  this  true  autocorrelation  peak 
(as  shown  by  the  arrow  in  (c))  that  is  the  desired  result  when  correlating  position, 
scale,  and  rotation  invariant  feature  spaces.  With  respect  to  rotation,  part  (d)  shows 
how  the  generic  CT  would  change  when  the  input  image  is  rotated  by  15°.  Now. 
when  the  rotated  version,  (d),  is  correlated  with  the  matched-filter,  (a),  the  resultant, 
peak  is  no  longer  a  true  autocorrelation.  This  is  illustrated  in  part  (e)  of  Figure  19. 
where  linear  correlation  of  a  cyclic  feature  space  is  shown  to  produce  an  asymmetri- 
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Figure  49.  Graphic  illustration  of  ‘lir  versus  2n  optical  correlation  where  (a)  is  an  example  of  2ir  space  ami 
(b)  is  an  expanded  4*  space  of  (a).  Graph  (c)  is  autocorrelation  of  (a).  Graph  (d)  is  (a)  shifted 
clockwise  45°  .  Graph  (e)  is  correlation  of  (a)  with  (d).  Graph  (f)  is  correlation  of  (cl)  with  \ rr 
spar*  (b). 


cal,  pseudo-autocorrelation  peak.  Asymmetry  occurs  since  the  feature  space  wraps 
around  when  the  input  image  is  rotated,  resulting  in  a  condition  where  the  rotated 
feature  space  may  not  perfectly  overlap  the  matched-filter  during  the  linear  correla¬ 
tion.  Casasent  proposed  that  this  phenomena  could  be  eliminated  by  correlating  the 
2tt  input  feature  space  with  an  expanded  47r  matched-filter  feature  space  [1:940]. 
Figure  49(f)  illustrates  that  correlating  with  an  expanded  feature  space  produces  a 
symmetrical,  autocorrelation  peak. 

4-3.2  Target  Scale  Measurable  translations  of  correlation  peaks  along  the 
radial  axis  were  observed  only  for  a  square  scaled  by  half  (see  Figures  43  and  45). 
In  this  case,  the  correlation  peak  shifted  20  pixels  (in  a  vertical  field  of  400  pixels) 
along  the  radial  axis.  According  to  data  presented  in  section  3.4.5  on  the  logarithmic 
response  of  the  CGH  (see  Figure  21),  a  scale  change  of  two  on  the  nonlinear  portion 
of  the  graph  will  shift  the  output  feature  space  by  approximately  23mm  in  an  field 
of  250mm  (the  size  of  the  display  used  in  this  research).  On  the  400  pixel  vertical 
field  of  the  CCD  camera,  23mm  would  correspond  to  a  shift  of  36.8  pixels.  However, 
in  the  optical  correlation  setup  used  in  this  study  (see  Figure  27),  the  sizes  of  the 
input  and  output  Fourier  transforming  lenses  (lenses  1  and  2)  were  chosen  to  scale’ 
down  the  output  correlation  by  a  factor  of  two.  Therefore,  the  theoretical  shift  of 
the  correlation  peak  was  18.4  pixels,  which  compares  quite  favorably  to  the  20-pixel 
shift  observed. 

But  why  were  correlation  peak  translations  due  to  scale  changes  only  observed 
for  templates  of  squares?  Due  to  the  nonlinear  response  of  the  CGI1  on  the  radial 
axis,  the  scale  change  of  a  small  input  image  incident  on  the  CGH  exhibits  a  greater 
shift  in  the  feature  space  than  an  equivalent  scale  change  of  a  much  larger  image.  The 
CGH  exhibited  a  greater  shift  in  the  feature  space  when  operating  in  the  nonlinear 
portion  of  the  logarithmic  curve  shown  in  Figure  21  (b).  The  square  templates  used 
during  the  research  were  much  larger  than  the  tank,  truck,  and  aircraft  templates. 
Consequently,  the  Fourier  transform  spaces  of  the  squares  were  notably  smaller  than 
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the  transform  spaces  of  the  other  templates.  The  larger  Fourier  transform  spaces 
were  coordinate  transformed  on  the  linear  portion  of  the  CGH’s  logarithmic  curve, 
where  changes  in  scale  result  in  very  small  translations  of  the  output. 

Ability  to  measure  scale  translations  in  the  FLRT  feature  space  is  also  depen¬ 
dent  on  the  resolution  of  the  of  the  detector  used  in  the  correlation  plane.  For  this 
research,  a  CCD  camera  with  a  digital  framegrabber  was  used  as  the  correlation 
plane  detector.  The  CCD  camera  has  a  vertical  range  of  400  pixels.  In  the  case  of 
the  scaled  square  example,  the  peak  only  translated  5  percent  of  the  vertical  range 
of  the  camera.  A  cylindrical  lens  could  be  used  to  expand  the  correlation  output  in 
the  radial  axis  to  increase  the  detector’s  ability  to  resolve  the  smaller  shifts  along 
the  radial  axis. 

4.3.3  Target  Detection  While  the  previous  sections  discussed  determining 
scale  and  rotation  changes  of  input  images  based  on  movement  of  the  autocorre¬ 
lation  peak,  this  presupposes  that  the  FLRT  feature  space  will  provide  correlation 
planes  in  which  auto-  and  cross-correlations  can  be  distinguished  from  one  another. 
In  other  words,  if  the  input  scene  contains  multiple  images  (or  other  sources  of  back¬ 
ground  noise),  will  the  FLRT  space  correlations  provide  adequate  resolution  to  allow 
determination  of  target  presence?  Figures  50-55  present  results  of  this  research  that 
provide  some  insight  into  answering  the  above  question.  In  Figures  50-52,  the  pho¬ 
tographs  in  (a)  and  the  plots  in  (b)  represent  correlation  planes  obtained  optically 
where  the  input  scene  contained  the  target  template,  as  well  as,  additional  images. 

Ideally,  one  would  expect  the  most  intense  peak  to  be  a  symmetric,  autocorre¬ 
lation  peak  positioned  appropriately  based  on  any  scale  or  rotation  changes.  \V  hilo. 
Figures  50  and  51  do  show  the  highest  intensity  peaks  at  the  expected,  zero-shift  lo¬ 
cation,  Figure  52  has  its  highest  peak  located  approximately  180''  from  the  expected 
location.  It  should  be  noted  that  the  apparent  lack  of  crosscorrelation  components 
in  Figure  51  is  due  to  an  increased  amount  of  neutral  density  filtering  used  when 
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(b) 


Figure  50.  Optical  correlation  of  the  CTs  of  a  truck  and  multiple  vehicles  where  (a 
intensity  distribution  and  (b)  is  the  3-D  plot.  Note:  No  shift  in  0  axis  ii 
0°  rotation.  Peak  shifted  5  pixels  in  radial  axis  compared  to  theoretical  (5. 
(based  on  linear  portion  of  CGH  In  curve). 


)  is  the 
ldieates 
8  pixels 
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Figure  51.  Optical  correlation  of  the  CTs  of  an  F-15  and  three  F-15s  where  (a)  is  the  intensity 
distribution  and  (b)  is  the  3-D  plot.  Peak  corresponds  to  autocorrelation  of  two 
large  F-15’s  (see  Figure  5(3). 


Igurc  52.  Optical  correlation  of  the  CTs  of  a  square  and  a  square,  triangle,  and  circle  where  (a  ) 
is  the  intensity  distribution  and  (b)  is  the  3-D  plot.  Note  the  large  crosscorrelation 
peaks  surrounding  the  autocorrelation  peak  at  pixel  location  (3(5. 3(5). 
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storing  the  output  to  disk. 

In  Figures  53-55,  the  correlations  were  of  scenes  that  did  not  contain  the 
target  template.  For  these  cases,  high-intensity,  autocorrelation-type  peaks  would 
not  be  expected.  However,  in  Figure  53,  the  optical  correlation  of  the  CTs  of  a 
truck  and  an  F-15  produces  a  very  symmetric- looking,  correlation  peak.  Figures  54 
and  55,  however,  do  show  the  more  ragged  nature  that  one  would  expect  from  a 
crosscorrelation. 

In  contrast  to  Figures  50-55,  Figures  56  and  57  show  optical  correlations  where 
the  only  input  image  was  the  template  or  the  scaled  version  of  the  template.  In 
comparing  these  eight  figures,  the  following  statements  can  be  made: 

1.  Sometimes  the  autocorrelation  peak  is  visually  evident  when  the  target  is  in 
the  input  scene.  However,  this  is  not  always  true. 

2.  Autocorrelation-type  peaks  are  visually  evident  in  some  cases  where  the  target 
is  not  in  the  input  scene. 

3.  Scaled  and  rotated  versions  of  targets  provide  very  good  autocorrelation  results 
when  they  are  the  only  images  in  the  input  scene. 

These  findings  lead  the  researchers  to  believe  that  an  intelligent  peak-finder 
(such  as  a  neural  network)  must  be  incorporated  to  distinguish  the  optically  gener¬ 
ated  autocorrelation  peaks  from  crosscorrelation  peaks.  Additionally,  it  is  believed 
that  going  to  the  4 7r  space  as  opposed  to  the  2 tt  space  will  remove  much  of  the 
ambiguity  from  the  identification  process  and  provide  much  better  resolution  to  the 
process  as  a  whole. 
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Figure  53.  Optical  correlation  of  the  CTs  of  a  truck  and  an  F-15  where  (a)  is  the  intensity 
distribution  and  (b)  is  the  3-D  plot.  Note:  Crosscorrelation  peak  could  easily  be 
confused  as  an  autocorrelation. 
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Figure  54. 


Optical  correlation  of  the  CTs  of  an  F-15  and  a  tank  where  (a)  is  the  intensity  dis¬ 
tribution  and  (b)  is  the  3-D  plot.  Note  the  wide  distribution  of  energy  characteristic 
of  a  crosscorrelation. 
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Figure  55.  Optical  correlation  of  the  CTs  of  a  square  and  a  tank  where  (a)  is  the  intensity  dis¬ 
tribution  and  (b)  is  the  3-D  plot.  Note  the  wide  distribution  of  energy  characteristic 
of  a  crosscorrelation. 
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Figure  56.  Optical  autocorrelation  of  the  CT  of  an  F-15  where  (a)  is  the  intensity  distribution 
and  (b)  is  the  3-D  plot.  Note  the  single,  intense  correlation  peak.  Its  position  is  a 
reference  for  unity  scale  and  0°  rotation. 
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Intensity  (0-255) 


4-4  Summary 

The  results  detailed  in  this  chapter  show  that  the  finr  —  fe  feature  space  can  be 
a  viable  coordinate  transform  to  be  used  in  a  position,  scale,  and  rotation  invariant 
pattern  recognizer.  Shifts  in  the  position  of  autocorrelation  peaks  were  shown  to  be 

predictable,  which  allows  scale  and  rotation  changes  to  be  determined. 

% 

Various  anomalies  present  in  the  current  system  were  identified  and  discussed. 
Among  these  are  the  distortion  of  the  CT  caused  by  the  CGH,  the  4ir  versus  2 tt 
feature  space,  and  the  artifact  produced  by  the  sampling  nature  of  the  CGH.  The 
following  chapter  discusses  conclusions  derived  from  the  results  and  lists  several 
suggested  topics  for  further  research. 
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V.  Conclusions  and  Recommendations 


5. 1  Introduction 

As  stated  in  section  1.3,  the  scope  of  this  thesis  focused  on  demonstrating  and 
evaluating  portions  of  Miazza’s  optical  design.  In  the  next  section  of  this  chapter, 
conclusions  are  drawn,  based  on  the  results  of  this  thesis  effort.  The  last  section  of 
this  chapter  provides  a  myriad  of  recommended  areas  of  study  which  could  follow 
from  this  current  research. 

5.2  Conclusions 

The  goal  of  this  research  was  to  investigate  the  optical  implementation  of 
correlation  techniques  in  the  magnitude-squared  Fourier  transform  In  r  —  0  (FLRT) 
feature  space.  To  accomplish  this  goal,  a  technique  for  recording  the  original  input 
templates  using  rubylith  templates  was  developed.  In  this  technique,  high  resolution 
glass  photographic  plates  were  used  to  record  the  negative  of  the  rubylith  template. 
This  process  produced  templates  with  vastly  improved  transmission  efficiencies  (in 
comparison  to  previously  used  Polaroid  slides)  and  limited  unwanted  diffraction. 

The  next  step  required  was  to  develop  an  improved  method  for  generating  the 
Fourier  transforms  of  the  input  templates.  This  was  accomplished  by  recording  ;i 
circularly  symmetric  high-pass  filter  (HPF)  on  a  high  resolution  plate,  and  then  pre¬ 
cisely  positioning  the  IIPF  in  the  Fourier  plane  of  a  two-lens,  Fourier  transforming¬ 
imaging  system.  As  a  result  of  successfully  eliminating  the  unwanted,  low  frequency 
components,  system  dynamic  range  was  increased  and  the  “blooming"  effect  pre¬ 
viously  seen  on  the  CCD  camera  was  eliminated.  Also,  for  the  first,  time,  high- 
contrast  black-and-white,  instant  slide  film  was  used  to  record  the  intensity  patterns 
of  the  Fourier  transforms  by  imaging  the  transforms  directly  onto  the  film  plane  of 
a  35mm  camera.  This  process  produced  the  data  needed  for  the  next  portion  of  the 
experiment — coordinate  transforming. 
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To  successfully  perform  the  optical  FLRT  coordinate  transformation  (CT)  of 
the  magnitude-squared  of  the  Fourier  transform  {FT),  a  new  method  of  aligning  and 
focusing  the  FT  onto  the  computer  generated  hologram  (CGH)  had  to  be  developed. 
This  was  accomplished  by  using  a  two-lens  imaging  system  which  allowed  the  FT  to 
be  imaged  directly  onto  the  CGH.  Imaging  onto  the  CGH,  as  opposed  to  the  previous 
method  of  butting  the  FT  against  the  CGH,  vastly  improved  system  alignment  and 
focusing  capabilities.  At  the  same  time,  the  two-lens  system  allowed  the  FT  to 
be  magnified  by  a  factor  of  three  to  overcome  a  space  bandwidth  product  problem 
associated  with  the  resolution  of  the  CGH.  It  was  also  shown  that  the  CGH  does 
behave  in  the  expected  fashion  by  giving  a  logarithmic  shift  along  the  vertical  axis  in 
response  to  scale  changes  of  the  input  images  and  linear  shifts  along  the  horizontal 
axis  in  response  to  rotations  of  the  input.  As  with  the  FT.  the  |CT|*  was  recorded 
using  the  high-contrast  film  and  a  35mm  camera. 

With  FLRT  CTs  in  hand,  a  setup  to  optically  generate  Vander  Lugt  matched- 
filters  had  to  be  designed.  A  unique  system  was  successfully  developed  that  used  a 
thermoplastic  holographic  camera  to  produce  matched-filters  in  approximately  one 
minute.  The  phase-relief  holograms  (matched-filters)  produced  by  the  thermoplastic 
camera  were  then  used  to  produce  successful  optical  correlations  of  the  FLRT  feature 
space  for  the  first  time. 

In  addition  to  developing  an  optical,  FLRT  correlator,  this  thesis  also  veri¬ 
fied  the  optical  results  using  digital  simulations.  CTs  produced  optically  compared 
very  favorably  with  the  digital  results,  showing  straight-line,  vertical  and  horizontal 
shifts  for  scale  and  rotation  changes,  with  the  horizontal  shifts  being  circular  shifts. 
Also,  both  optical  and  digital  CTs  of  vehicles  tended  to  look  very  similar  due  to 
the  rectangular  nature  of  each.  However,  results  showed  that  even  with  the  similar¬ 
ities  in  appearance,  the  optical  correlator  could  still  differentiate  between  the  FLR'l 
feature  spaces.  Also,  with  the  FLRT  feature  space,  correlations  produced  optically 
exhibited  predictable  movements  in  response  to  scale  and  rotation  changes,  and  auto- 
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correlations  versus  crosscorrelations  could  be  visually  discerned  in  most  cases.  Thus, 
this  research  has  shown  that  portions  of  the  Kobel-Martin-Horev  (KMH)  algorithm 
can  be  implemented  optically  and  are  candidates  for  implementation  in  a  real-time 
system. 

5.3  Recommendations 

This  research  has  shown  some  promising  results  toward  the  goal  of  optically  im¬ 
plementing  a  real-time,  PSRI-based,  pattern  recognition  system.  It  is  recommended 
that  research  in  this  area  continue,  and  to  that  end.  the  following  list  of  specific 
topics  is  provided. 

1.  An  important  consideration  concerning  optical  correlations  of  FLRT  feature 
spaces  is  that  a  problem  exists  with  trying  to  correlate  the  circular  shift  from 
rotations  in  the  feature  space  using  a  linear  correlation.  Because  of  this,  a 
4ff  versus  2ff  feature  space  is  required  so  exact  autocorrelations  can  be  made 
possible  (see  Figure  49).  Additionally,  going  to  the  Iff  space  will  eliminate  the 
rotation  ambiguity  problem  discussed  earlier.  Different  methods  of  implement¬ 
ing  a  lff-wide  feature  space  should  be  investigated.  Possibilities  include  using 
a  spatial  light  modulator  (SLM)  to  display  two  copies  of  a  2ff-wide  feature 
space  positioned  next  to  one  another,  developing  a  CGH  that  will  automati¬ 
cally  produce  the  4ff-wide  space,  or  imaging  two  copies  of  the  2ff-wide  space 
onto  the  film  plane  of  a  35mm  camera  to  produce  the  wider  feature  space. 

2.  Spatial  light  modulators  should  be  Used  to  implement  a  real-time  system.  This 
can  be  accomplished  by  using  the  SLMs  as  input  sources  for  binarized  Fourier 
transform  and  coordinate  transform  intensity  patterns,  instead  of  using  instant 
slide  film  to  record  those  intensity  patterns.  Two  electronically  addressable 
SLMs  are  available  for  research  work  at  this  time. 

3.  Non-linear,  barium  titanate  or  lithium  niobate  crystals  should  be  investigated 
as  a  possible  mechanism  for  stripping  the  phase  from  a  complex,  input  scene. 
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This  may  be  possible  because  of  the  crystal’s  ability  to  produce  uniform  ampli¬ 
tude  waves.  If  this  can  be  done  successfully,  the  entire  KMH  algorithm  could 
be  implemented  in  near  real-time. 

4.  The  computer  generated  hologram  should  be  further  characterized  with  respect 
to  the  SBVVP  problem.  An  approach  here  might  be  to  produce  several  different 
sized  copies  of  the  same  input  template.  And,  in  addition,  collect  data  for  a 
series  of  rotation  changes — maybe  in  10°  increments.  Also,  the  apparent  non¬ 
linear  distortion  produced  by  different  regions  of  the  CGH  should  be  looked 
into. 

5.  Because  a  dynamic  range  problem  did  not  allow  sufficient  resolution  along  the 
radial  (vertical)  axis  in  the  FLRT  space,  improvement  in  the  dynamic  range 
should  be  investigated.  One  possibility  is  to  use  a  cylindrical  lens  to  magnify 
the  correlation  plane  in  the  radial  direction  only.  This  will  enhance  the  dynamic 
range  by  allowing  shifts  due  to  scale  changes  to  be  more  readily  detected. 

6.  Once  a  good  set  of  4 7r  feature  space  correlations  are  obtained,  some  type  of 
intelligent  peak  detector  (maybe  a  neural  network)  should  be  trained  to  deter¬ 
mine  whether  or  not  the  correlation  peaks  are  sufficiently  developed  to  distin¬ 
guish  crosscorrelation  peaks  from  autocorrelation  peaks.  Also,  the  correlation 
data  obtained  from  this  research  could  be  analyzed  with  respect  to  the  47t  data 
to  determine  the  amount  of  improvement,  if  any,  that  is  obtained. 

7.  Although  this  research  concentrated  on  the  /|n  r  —  fg  coordinate  transform, 
other  prospective  transforms  exist  which  could  be  used  to  produce  feature 
spaces  that  are  invariant  to  shifts,  scale  changes,  and  rotations.  One  of  these' 
is  known  as  the  Hough  transform.  These  researchers  believe  that,  with  the 
methods  used  in  this  thesis,  an  optical  Hough  transform  can  be  easily  generated 
on  the  optics  table.  This  should  be  pursued. 

8.  Another  item  that  this  research  uncovered  but  did  not  investigate  is  the  pos¬ 
sibility  of  using  the  thermoplastic  camera  to  produce  phase-only  fdters.  The 
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thermoplastic  material  produces  a  phase-relief  hologram,  as  opposed  to  an  am¬ 
plitude  hologram.  With  this  in  mind,  it  may  be  possible  to  illuminate  the 
thermoplastic  with  the  interference  pattern  of  a  phase  function  to  record  the 
phase  function  directly  onto  the  film.  If  this  works,  it  might  be  possible  to 
generate  a  coordinate  transform  feature  space  of  complex  inputs  in  real-time. 

9.  Finally,  since  the  feature  space  investigated  does  not  handle  out-of-plane  rota¬ 
tions,  it  is  recommended  that  further  research  should  concentrate  on  applica¬ 
tions  where  these  rotations  are  unlikely  or  have  been  eliminated.  Two  examples 
are  satellite  imagery  and  synthetic  aperture  radar  (SAR)  data. 
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Appendix  A.  DC-Block  and  Image  Template  Preparation 

A.  I  Introduction 

Based  on  the  results  of  Mayo’s  work  [12]  and  an  unsuccessful  attempt  to  use 
Polaroid  slides,  it  became  necessary  to  develop  a  method  for  producing  high  quality, 
high  resolution  dc-blocks  (high-pass  filters)  and  binary  image  templates.  The  de¬ 
block  used  by  Mayo  was  a  small,  black,  ink  spot  on  a  clear  glass  slide.  The  ink  spot 
was  not  symmetric  (circular)  and  clearly  not  uniform  in  density.  Thus,  it  did  a  poor 
job  of  uniformly  attenuating  the  low  frequency  components  of  a  Fourier  transform 
and  resulted  in  the  production  of  non-symmetric,  optical  Fourier  transforms. 

Some  of  Mayo’s  image  templates  were  very  rough  cutouts  of  geometric  images 
and  letters.  These  rough  cutouts  had  the  disadvantage  of  causing  unwanted  diffrac¬ 
tions  around  the  edges  of  the  images.  Additionally,  an  initial  attempt  in  this  thesis 
effort  at  using  Polaroid  instant  slide  film  for  recording  template  images  failed  mis¬ 
erably.  The  Polaroid  slides  greatly  limited  transmission  and  suffered  from  internal 
reflections  which  produced  a  multiple-image  effect. 

A. 2  DC- Blocks 

Why  have  dc-blocks  in  the  first  place?  The  answer  is  twofold.  First,  the 
information  we  want  to  detect,  the  outlines  of  targets  in  input  scenes,  produces  high 
frequency  terms  in  the  frequency  (spatial)  domain.  Thus,  if  we  can  eliminate  the 
unwanted,  low  frequency  energy,  we  can  increase  the  relative  strength  of  the  higher 
frequencies.  In  other  words,  a  kind  of  edge  enhancement  of  the  image.  This  edge 
enhancement  makes  it  easier  to  “recognize”  the  target. 

Secondly,  because  the  low  frequency  component  of  a  Fourier  transform  contains 
a  delta  function  (mathematically  speaking)  derived  from  the  average  intensity  of  the 
input  scene,  this  component  is  very  intense.  And,  since  a  CCD  camera  was  being 
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used  to  detect  the  intensity  when  recording  Fourier  transforms,  a  conscious  effort  had 
to  be  made  to  eliminate  an  affect  known  as  “blooming.”  If  the  CCD  array  is  subjected 
to  a  beam  intense  enough  to  saturate  the  array  elements,  electric  charges  collected 
in  the  elements  will  “overflow”  into  surrounding  elements.  Thus,  a  “blooming” 
affect.  To  prevent  this,  an  effort  had  to  be  made  to  ensure  saturation  did  not 
occur.  This  could  have  been  accomplished  by  attenuating  the  entire  input  beam, 
including  the  dc  portion.  However,  this  would  have  severely  limited  the  dynamic 
range  of  our  system,  relative  to  the  high  frequency  components,  since  the  intensity 
of  the  desirable  frequencies  would  have  been  attenuated  to  an  unacceptably  low  level. 
Therefore,  blocking  the  dc  component  allowed  the  amount  of  attenuation  required 
to  prevent  “blooming”  to  be  reduced  while,  at  the  same  time,  enhanced  the  dynamic 
range  of  our  system. 

A.  3  Preparation 

To  solve  the  problem  of  preparing  high  quality  dc-blocks  and  binary  imago 
templates,  this  thesis  effort  used  Kodak,  high  resolution,  glass  plates  (2”x2”  Typo 
1A  ultra-flat  photographic  plates)  .  The  plates  used  are  the  same  type  previously 
used  by  Mayo  to  reduce  and  record  the  coordinate  transforming,  computer  generated 
hologram  (see  Appendix  B).  These  plates  are  available  through  GSA  Contract  C1S- 
00F-77352,  modification  61,  dated  February  28,  1987.  Facilities  for  exposing  and 
developing  the  glass  plates  were  available  in  the  Cooperative  Electronics  Materials 
Process  Lab,  room  1065  of  building  125.  Figure  58  shows  the  optical  setup  of  the 
Dekacon  camera  used  to  reduce  and  record  the  de  blocks  and  binary  templates. 

A. 3.1  The  Dekacon  The  Dekacon  camera  system  consists  of  four  main  sub¬ 
sections  (see  Figure  58.  First,  the  object  screen  is  a  four  foot  by  four  foot  apparatus 
which  contains  a  green  light  source  to  provide  the  illumination  required  for  film 
exposure.  It  is  also  a  rigid  support  structure  used  to  hold  the  transparency  being 
photoreduced. 
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Second,  the  camera  support  table  is  a  heavy,  rigid  steel  device  which  has  been 
calibrated  to  allow  precise  positioning  of  the  front  and  rear  boxes.  The  rigidity 
and  weight  of  the  table  also  provide  considerable  stability  for  eliminating  unwanted 
vibrations  (noise)  which  could  affect  the  quality  and  resolution  of  the  photoreduction. 

Next,  the  front  box  is  the  mounting  mechanism  for  any  one  of  several  available 
lenses.  A  35mm  lens  is  available,  as  are  a  12-inch  Wray,  a  5-inch  Wray  ,  and  a  3-inch 
Wray.  The  3-inch  and  5-inch  Wrays  were  used  in  this  thesis  effort  for  preparing  lOx 
and  20x  reductions  of  image  templates. 

The  last  subsection  is  the  rear  box  which  provides  the  mounting  mechanism 
for  the  high  resolution  film  plates.  The  rear  box,  in  conjunction  with  the  front  box. 
can  be  moved  along  the  calibrated  table  to  position  it  for  the  desired  photoreductiou 
power. 

A. 3. 2  The  Procedure  The  first  step  is  to  position  the  front  and  rear  boxes  ac¬ 
cording  to  the  amount  of  reduction  desired.  A  manual  available  in  room  10G5  details 
the  required  box  settings  for  the  various  reductions  for  the  different  lenses.  Once  the 
boxes  are  in  place,  the  Dekacon  has  to  be  focused  for  the  particular  reduction  desired 
before  any  photoreduction  can  be  accomplished.  This  is  done  by  first  mounting  a 
scratched  glass  plate  in  the  film  plane  with  the  scratches  facing  toward  the  object 
screen.  Now,  with  the  object  to  be  focused  mounted  on  the  object  screen,  a  micro¬ 
scope  objective  is  used  to  bring  both  the  object  and  the  scratches  simultaneously 
into  focus.  At  this  point,  the  camera  is  ready  to  do  a  photoreduction. 

With  the  template  in  position  on  the  object  screen  and  the  camera  focused  for 
the  desired  reduction,  an  unexposed  high  resolution  plate  is  inserted  into  the  film 
holder  portion  of  the  rear  box.  This  must  be  accomplished  with  all  lights,  except 
red,  extinguished.  Once  the  unexposed  plate  is  in  position,  the  green  light  of  the 
object  screen  is  illuminated  and  the  plate  is  exposed.  This  research  has  found  that 
an  exposure  time  of  four  to  five  minutes  provides  enough  photons  to  successfully 
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expose  the  film  plate. 


After  exposure,  the  plate  is  removed  from  the  rear  box  (again  with  the  lights 
off)  and  taken  to  a  dark  room  for  development.  In  this  research,  the  following  times 
were  used:  development  time — 2  minutes,  stop  time — 20  seconds,  and  fixer  time — 1 
minute.  Depending  upon  the  age  of  the  film  plates  and  the  age  of  the  developer,  the 
exposure  time  and  development  time  may  need  to  be  varied  until  the  best  quality  is 
obtained.  One  method  of  testing  the  developer  is  to  put  an  unexposed  plate  in  the 
developer  and  determine  the  time  required  for  the  plate  to  start  to  become  cloudy. 
That  time  period  should  then  be  used  for  the  development  time. 

.4.3.3  Templates  The  template  used  for  making  the  dc  blocks  was  a  common 
flat  washer.  The  washer  was  painted  black  in  an  effort  to  reduce  any  stray  reflect  ions 
and  then  mounted  onto  the  object  screen.  By  changing  lenses  and  reductions  powers, 
several  different  sized  black  dots  were  produced.  These  dots  would  then  be  used  as 
the  dc  blocks. 

To  produce  the  image  templates,  red  Rubylith  was  manually  cut  into  the  de¬ 
sired  shapes.  These  Rubylith  cutouts  were  then  mounted  on  the  object  screen  and 
photoreduccd.  The  reason  Rubylith  was  used  is  that  it  blocks  the  green  light  from 
the  object  screen  while,  at  the  same  time,  was  easy  to  cut  into  the  desired  shapes. 
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Object 

Screen 


Figure  58.  Dekacon  Camera  Setup,  Rm  1065  of  Bldg  125.  Note:  Used  to  photore- 
duce  image  templates,  dc  blocks,  and  CGHs. 
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Appendix  B.  The  Computer  Generated  Hologram 


This  appendix  details  the  methodology  used  for  producing  a  computer  gener¬ 
ated  interferogram.  The  interferogram  of  interest  is  one  that  performs  an  optical 
In  r  —  9  coordinate  transformation.  Much  of  the  following  discussion  is  adapted 
from  the  work  of  Mayo  [12]. 

B.l  Generation 

The  computer  generated  hologram  (CGH)  is  a  binary  fringe  pattern  which  lias 
been  produced  (printed)  by  a  laser  printer  and  transferred,  via  photorcduction,  to 
a  high  resolution  glass  plate.  When  illuminated  with  coherent  light,  the  CGH  will 
diffract  the  light  in  such  a  way  that  many  diffracted  orders  are  produced.  To  ensure 
that  the  first  diffracted  order,  which  contains  the  coordinate  transform,  is  separated 
from  the  higher-order  diffractions,  a  sufficiently  high  carrier  frequency  must  be  used. 
The  following  equation  must  be  satisfied  for  the  carrier  frequency  a  to  ensure  no 
overlap  of  diffracted  orders  occurs 


Where  xmax  and  ymax  are  input  dimensions,  A  is  the  wavelength  of  light  used, 
and  fi  is  the  focal  length  of  the  transforming  lens  used  in  conjunction  with  the  CG1I. 

For  this  particular  work,  the  final  dimensions  of  the  CGH  were  to  be  xm,u.  = 
Vmax  =  10mm,  the  wavelength  of  light  was  A  =  632. 8nm  (Helium-Neon  laser  source), 
and  the  focal  length  was  Ji  =  300mm.  With  these  parameter  values,  the  value  for 
q  necessary  to  solve  the  equation  is  30.9  lines  per  millimeter.  An  a  of  35  lines  per 
millimeter  was  chosen  to  meet  the  requirement. 

To  create  the  CGH,  the  binary  fringes  of  the  transmittance  function  were 
printed  on  a  laser  printer  and  then  optically  reduced  using  the  technique  discussed  in 
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Appendix  A.  For  the  In  r  —  9  coordinate  transform  CGH,  the  transmittance  function 
obtained  by  interfering  the  required  phase  function  given  by 


4>(x,y)  = 


2  TT 


A/l 


[ln(x2  -f  y2 )*  ~  yarctan(y/x)  -  x 


(24) 


with  a  carrier,  a,  is  given  by 


t(x,  y)  =  0.5  [1  +  cos(27rax  -  <j>(x,  y))] 


(25) 


The  transmittance  function  then  has  fringes  (maxima)  where  the  argument  of 
the  cosine  satisfies 

27TQX  —  4>{x,  y)  =  lirrik  (20) 

for  integer  values  of  n*.,  where  each  value  of  k  corresponds  to  a  different  fringe. 

Equation  26  is  then  used  to  generate  a  128  by  128  array  of  numbers  of  the 
rnaximurns  at  evenly  spaced  intervals.  The  Metalib  program  on  the  AFIT/ENP  Sun 
workstation  is  used  to  draw  the  contour  plot  based  on  these  numbers.  This  plot  is 
the  CGH.  An  example  is  shown  in  Figure  59. 


B.2  The  Metalib  Program 

As  previously  mentioned,  a  program  called  Metalib  was  used  to  plot  the  CGI I. 
In  particular,  a  routine  of  Metalib  called  RCONTR  was  used.  This  routine  is  shown 
below  [5:49-55]. 
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*uro  50.  Laser  printer  plot  of  a  C'GH  with  A  =  632. 8nm  and  //„  =  300mm.  (Note: 
photorcduct  ion  hv  20  to  1  will  give-  an  a  >  35  lines/mm). 


c 

c  *********  ln  r. theta  interferqgram  ************** 
c 

real  SURF AC( 128, 128) ,  X,  Y,  ALPHA,  LAMDA,  FOCAL,  TEMP,  PI, 

+  INCX,  INCY,  CONTV (2) ,  KS,  Xcoord(128),  Ycoord(128) 
integer  ICTRS 
C 

ICTRS  =  0 

KS  =  -400.0  'not  needed — returns  number  of  lines  plotted 
CONTV(l)  =  -350.0  'set  to  minimum  value  in  array 
C0NTV(2)  =  .6  'delta  value 

C 

X  =  -2.51  'minimum  dimension  of  CGH 

Y  =  -2.51  'minimum  dimension  of  CGH 

ALPHA  =45.0  'Angle  of  plane  reference 
FOCAL  =  300  'focal  length  of  associated  lens 
LAMDA  =  0.0006328  'wavelength 
INCX  =5.0  /  128.0  'step  increment  of  x 
INCY  =  5.0  /  128.0  'step  increment  of  y 
PI  =  3.1415926 
C 

do  10  j  =  1,  128,  1  'this  loop  generates  data 

do  20  i  =  1,  128,  1 

if  (  (X.lt.0)  .and.  (Y.gt.0)  )  then 
TEMP  =  X*0 . 5*alog(X*X+Y*Y) -Y* (PI+atan(Y/X) ) -X 
else  if  (  (X.lt.0)  .and.  (Y.lt.0)  )  then 
TEMP  =  X*0 . 5*alog(X*X+Y*Y)+Y* (PI-atan(Y/X) )-X 
else 

TEMP  =  X*0 . 5*alog(X*X+Y*Y)-Y*atan(Y/X)-X 
end  if 

SURFAC(i , j)  =  ALPHA*X  -  (1/(LAMDA*F0CAL) )*TEMP 
Xcoord(i)  =  X 
X  =  X  +  INCX 
20  continue 
Ycoord(j)  =  Y 
X  =  -2.51 
Y  =  Y  +  INCY 
10  continue 
C 

C  ****  METALIB  Subroutines  ***  (described  in  manual) 

C 

call  setplot(  "lnplotl28",  7,  0,  .false.) 
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c 

call  mapxy(  -2.51,  2.49,  -2.51,  2.49,  0.0,  0.7,  0.0,  0.9,  0,  1  ) 

C 

call  rcontrC  ICTRS,  CONTV,  KS,  SURFAC,  128,  Xcoord,  1,  128,  1, 
+  Ycoord,  1,  128,  1,  .false.  ) 

C 

call  endplot 
C 

end 

The  function  of  the  METALIB  subroutines  SETPLOT,  MAPXY  and  END- 
PLOT  are  explicitly  explained  in  the  METALIB  manual.  In  the  RCONTR  sub¬ 
routine,  the  role  of  ICTRS,  CONTV(l),  CONTV(2),  and  KS  can  be  confusing. 
With  ICTRS  =  0  all  contours  of  the  form  CONTV(l)  +  m  *  CO.V7T'(2).  m  deter¬ 
mined  by  METALIB,  are  drawn.  METALIB  assumes  CONTV(l)  contains  a  starl¬ 
ing  value  and  CONTV(2)  a  delta  value.  It  is  computationally  economical  to  set 
CONTV(l)  =  min  SU RF AC(X ,Y),  however  is  may  be  set  to  any  value  below 
min  SU RF AC(X,  Y)  but  not  above.  CONTV(2)  controlls  the  density  of  the  plot. 
METALIB  increments  on  CONTV(2)  to  draw  contours  by  interpolating  between 
values  given  in  SURFAC(X,Y).  CONTV(2)  must  be  adjusted  to  fit  the  specific  con¬ 
ditions  of  wavelength,  alpha,  focal  length,  and  plot  dimensions.  C'ONTV(2)  =  .fi 
gave  the  best  results  for  the  parameters  used. 
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Appendix  C.  Sight- Mod  Spatial  Light  Modulator 


During  this  thesis  effort,  AFIT’s  Electrical  Engineering  Optics  Laboratory  ac¬ 
quired  two  128  by  128  pixel  Semetex  Sight-Mod  binary  spatial  light  modulators 
(SLM).  These  devices  are  very  similar  to  the  Litton  SLM  investigated  by  Mayo  [12]. 
Both  are  electronically  addressable,  magneto-optical  devices  which  modulate  infor¬ 
mation  on  a  linearly-polarized  light  beam  by  individually  rotating  the  polarization  of 
light  incident  on  each  individual  pixel.  The  polarization  is  rotated  either  clockwise 
or  counterclockwise  depending  on  how  the  pixel  was  addressed.  A  polarizer  at  tin* 
output  is  positioned  to  pass  one  polarization  state  and  block  the  other. 

The  main  difference  between  the  SLM  investigated  by  Mayo  and  the  Sight  - 
Mod  is  the  computer  interface’s  addressing  scheme  used  to  electronically  address  tin* 
128  by  128  array.  Mayo’s  Litton  device  was  originally  designed  to  interface  with  an 
Apple  II  computer.  Mai'o  converted  the  interface  and  software  so  the  Litton  SLM 
could  be  addressed  with  a  Zenith  248  (IBM  compatible)  computer.  The  Litton  SLM 
was  designed  to  address  only  one  pixel  at  a  time.  The  Sight- Mod  was  designed 
to  interface  with  an  IBM  AT  (or  compatible)  computer  addressing  8  pixels  at  a 
time.  Consequently,  the  software  developed  by  Mayo  to  drive  the  Litton  device  is 
incompatible  with  the  Sight- Mod. 

I  he  Sight-Mod  has  provided  source  code  (written  in  the  C  and  Basic  program¬ 
ing  languages)  for  demonstration  routines  for  testing  the  device.  Eight  pixels  are 
addressed  at  a  time  by  writing  a  byte  (8  bits)  to  an  assigned  segment  in  the  com¬ 
puter’s  memory.  The  address  segment  is  selected  using  the  dip  switches  located  on 
the  Sight-Mod  computer  interface  board.  Details  on  changing  the  segment  address 
and  the  memory  addressing  scheme  can  be  found  in  the  “Sight-Mod  Development 
System  Operations  Manual”  [17].  DOCK)  was  the  address  segment  used  during  this 
thesis  research.  For  the  128  by  128  array,  an  image  is  displayed  by  individually 
addressing  memory  locations  D000-D7FF.  Each  memory  location  addresses  8  con- 
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secutive  pixels  in  a  column  of  the  array.  With  this  format,  a  column  of  the  SLM  ( 128 
pixels)  can  be  written  to  by  addressing  only  16  bytes  of  memory.  Semetex  calls  the 
eight  rows  addressed  by  one  byte  a  “byte  row”.  For  example,  if  an  array  has  a  pixel 
location  (r,c)  where  r  is  rows  0-127  and  c  is  columns  0-127,  memory  location  D000 
would  correspond  to  eight  pixels  (0,0)-(7,0)  in  column  0.  The  binary  state  of  each 
bit  of  the  eight-bit  word  written  to  a  memory  location  determines  the  on  or  off  state 
of  the  pixels  addressed.  For  a  byte  written  to  D000,  a  one  in  the  least  significant  bit 
(bit  0)  will  nucleate  the  (0,0)  pixel,  and  a  one  in  the  next  bit  will  nucleate  pixel  ( 1 ,0). 
Nucleation  is  an  intermediate  state  of  a  pixel  that  is  being  set.  A  pixel  is  completely- 
set  after  the  magnetic  coil  surrounding  the  array  is  excited.  This  is  accomplished 
by  addressing  memory  location  D801  after  all  pixels  have  been  addressed,  thereby- 
completing  the  writing  process.  Note  that  reading  the  most  significant  bit  of  memory- 
location  D800  will  indicate  if  the  SLM  is  busy  (it  will  be  a  1)  or  ready  (it  will  be  a 
0).  In  addition,  the  entire  array  can  be  erased  by  addressing  D800. 

The  SLM  has  considerable  potential  as  a  tool  for  demonstrating  optical  archi¬ 
tecture  proposed  for  real-time  pattern  recognition.  The  applications  range  from  a 
incoherent-to-coherent  image  converter  to  removal  of  the  phase  from  a  complex  wave- 
front.  For  this  research,  the  SLM  was  used  to  modulate  intensity  distributions  of 
complex  wavefronts  detected  and  stored  with  a  CCD  camera  and  the  AT.CT  TARCA 
framegrabber.  The  computer  source  code  used  to  redisplay  TARCA  images  is  listed 
below.  The  program  was  written  and  compiled  using  Turbo  C’.  This  program  takes  a 
512  by  400  pixel.  256  grey  level,  TARCA  format  image  and  reduces  it  to  128  by  100 
pixels  by  averaging  and  binarizes  it  by  thresholding.  The  scaled,  binarized  image  is 
then  written  to  the  SLM. 


110 


/************************************************************** / 


/*  SLM_DISPLAY_TARGA  */ 
/*  */ 
/*  Written  by  J.  Thomas  Walrond  and  Timothy  Childress  */ 
/*  Program  displays  a  TARGA  image  file  on  the  Sight-Mod  */ 
/*  Spatial  Light  Modulator.  First  the  image  is  reduced  */ 
/*  from  512  by  400  pixels  to  128  by  100  pixels  by  */ 
/*  averaging.  Then  the  image  is  binarized  by  thresholding.*/ 
/*  Then  the  image  is  reformatted  and  written  to  the  Sight-*/ 
/*  Mod.  */ 


/**************************************************************/ 

#include  "stdio.h" 

#include  "dos.h" 

^include  "math.h" 


int  screen=0xb000 ,  value=0xf0,  val ; 

unsigned  temp  [4]  [512],  avgtemp  [128]  [128],  tempi; 

int  i , j , j j , j i ,x,y .r.avgi ,1 ,favg,  bt.nr; 

char  ctemp,  cval.slm  [16]  [128]; 

float  avg, sum, f temp,  jf,  thrshld=50; 

int  c.busy; 

FILE  *frl; 


main() 

{  pokeb(screen,  0x800,  Oxf f ) ; /*clear  slm*/ 

for(busy=0x80;  busy==  0x80;  ) 
busy  =  0x80  k  peekb(screen, 0x800) ; 

/♦Retrieve  a  a  Targa  CCD  file  (512x400),  reduce  to  128x100  by  blurring.*/ 


f rl=fopen("I 13_f c .tga" , "rb") ; /*get  ccd  file*/ 
for(r=0;  r<100;r++)/*retrive  data  sequentially  into  4x512array*/ 

{ 

for(x=0;  x<4;  x++) 

{ 

for(y=0;  y<5 12 ;  y++) 

{ 


templ=ctemp ; 


ctemp=getc(f rl) ; 

if ( t emp 1 >255 )templ=temp 1-65280 ; 
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temp  [x]  [y]=  tempi; 

/♦printf ("Xu  \n" ,  temp  [x]  Cy] ) ; 

printf ("Xu" , x) ; 

printf ("Xu  \n",y); 

printf ("Xu  \n",  ctemp  [x]  Cy]);*/ 

> 

> 

for(j=0;  3 <512 ;  j+=4){/*reduce  4x512  array  to  1x128  by  avg*/ 

sum=0 ; 

f°r(jj=j;  jj<j+4;  jj++){ 
for(i=0;  i<4;  i++){ 

ftemp=temp  [i]  [j j]  ; 
sum=sum  +  ftemp; 

> 

> 

avg=ceil((sum/l6)-0.5) ; 
avgi=avg; 

jf=j ; 

jf=jf/4; 

ji=Jf ; 

avgtemp  [r]  [ji]  =avgi ; 

/♦printf ("Xu", avgi) ;  */ 

> 

> 

fclese(frl) ; 

/♦binarize  pixels  based  on  threshold*/ 
for(x=0;  x<128;  x++){ 
for(y=0;  y<128;  y++){ 

/♦printf ("Xu  \n",  avgtemp  [x]  [y]);*/ 

if (avgtemp  [x]  [y]  <  thrshld) avgtemp  [x]  [y]  =  1; 

else  avgtemp  [x]  [y]  =0; 

/♦printf ("Xu  \n",  avgtemp  [x]  [y]);*/ 

> 

/♦printf ("Xu  \n",  avgtemp  [x]  [y]); 
printf ("Xu",  x) ; 
printf ("Xu",  y) ; */ 

} 

/♦convert  8  pixels  into  a  eight  bit  character  for  addressing  the  SLM*/ 
for(c=0;  c<128;c++){ 
nr=0 ; 

for(r®0;  r<96;  r+=8){ 
bt=r ; 
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if(avgtemp  [bt]  [c]  *=  l)val*val+l; 
bt++; 

if(avgtemp  [bt]  [c]  ==  l)val=val+2; 
bt++; 

if(avgtemp  [bt]  [c]  ■*  l)val=val+4; 
bt++; 

if(avgtemp  [bt]  [c]  *=  l)val=val+8; 
bt++; 

if(avgtemp  [bt]  [c]  *=  l)val=val+16 ; 
bt++; 

if(avgtemp  [bt]  [c]  *=  l)val=val+32 ; 
bt++ ; 

if(avgtemp  [bt]  [c]  ==  l)val*val+64 ; 
bt++ ; 

if(avgtemp  [bt]  [c]  ==  l)val=val+128; 
bt++ ; 

/♦printf ("Xu  \n",  val);*/ 
cval=val ; 

/♦if (cval>128)  templ=templ-65280;^/ 
slm  [nr]  [c]=cval; 

/♦printf  ('"/.u  \n",  slm  [nr]  [c]);*/ 
nr++; 

> 

> 

/*  nr=13; 

for(c=0;  c<128;  c++){ 

for(r=96;  r<128;  r+=8){ 
val=0 ; 

cval=val ; 

slm  [r]  [c]  *  cval; 

> 

>  */ 

r=0;  /*  fclose(frl) ;  ♦  / 

for(x=0;  x<16;  x++){ 
f or (y=0 ;  y<128;  y++){ 

/♦printf  ("*/.u  \n",  slm  [x]  [y]  ) ;  */ 

pokeb(screen,  r,slm  [x]  [y] ) ; 


> 

> 

pokeb ( screen , 0x80 1 , Oxf f ) ; 

> 


It  also  should  be  noted  that  TARGA  has  a  software  package  called  “TARGA 
Tools”  that  could  be  very  useful  in  interfacing  the  CCD  camera  with  the  Sight- 
Mod  in  a  real-time  fashion.  These  programs  have  been  written  in  C  and  compiled 
using  the  Microsoft  C-86  compiler.  TARGA  tools  enables  a  programmer  to  remotely 
grab  an  image  into  the  TARGA  board’s  external  memory  where  it  can  accessed 
for  further  processing,  such  as  displaying  on  the  Sight-Mod.  The  TARGA  board  is 
capable  of  grabbing  an  image  into  memory  in  1  / 30th  of  a  second.  The  Sight-Mod 
is  capable  of  displaying  a  frame  as  fast  as  1  /300th  of  a  second.  Therefore,  the  only 
other  limiting  factors  are  the  memory  access  time  for  reading  the  TARGA  memory 
and  the  computational  time  for  reducing,  binarizing,  and  reformatting  the  data  for 
display  on  the  Sight-Mod. 
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Appendix  D.  CCD  Camera  Image  Format 

D.l  CCD  Images 

To  save  resultant  images  during  this  research,  a  Sony,  charged  coupled  device 
(CCD)  camera  was  used  in  conjunction  with  an  AT&T  Truevision  Raster  Graph¬ 
ics  Adapter  (TARGA  8).  The  TARGA  8  was  installed  in  a  Zenith  248  (IBM  AT 
compatible)  computer.  The  TARGA  8  board  captures  a  full  NTSC  frame  in  1  / 30th 
of  a  second.  The  captured  image  is  digitized  into  a  512  by  400  pixel  array  using  a 
256-level  grey  scale.  The  TARGA  8  then  displays  the  digitized  image  on  a  composite 
monochrome  monitor. 

Using  the  True_art  software  package  provided  with  the  TARGA  8,  images  can 
be  captured  and  saved  as  a  computer  file  for  image  processing  or  redisplay.  Digitized 
images  are  stored  as  8-bit,  unsigned  integer  pixels  in  a  512  by  400  row-major  format 
array.  The  array  is  stored  in  a  sequential  file  where  each  record  is  a  row  of  imagery. 

An  image  file  stored  on  a  MS  DOS  formatted  5  1/4  inch  disk  can  be  transferod 
to  the  VAX  I780A  using  the  binary  file  transfer  protocol  (FTP)  on  the  ETHERNET. 
The  file  transfer  can  be  accomplished  using  the  following  procedure: 

1.  Using  an  IBM  compatible  computer  connected  to  the  ETHERNET,  type  “ftp 
<return>ri  to  enter  the  file  transfer  protocol  program. 

2.  Type  “binary  <return>”  to  indicate  that  a  binary  file  is  being  transferred. 

3.  Type  “open  imv2a  <return>”.  Then  type  “user  <return>”  at  the  prompt  for 
username,  and  type  “<return>’1  at  the  prompt  for  a  password.  The  IMV2A 
is  a  gateway  for  file  transfer  on  the  ETHERNET  to  the  DECnet. 

4.  Type  “put  filename  1  filcnamc2  <return>”  where  filename  1  is  the  name  of  the 
file  you  are  transfering  from  the  the  disk,  and  filename2  is  the  name  the  file 
will  have  in  the  USER  directory  of  the  IMV2A.  Filenamel  and  filenamc2  can 
be  the  same  name  but  do  not  have  to  be. 
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5.  Once  the  file  transfer  is  complete,  the  FTP  can  be  exited  by  typing  either  “bye 
<return>”  or  “quit  <return>.” 

6.  The  file  transfer  can  now  be  completed  by  logging  on  to  the  VAX  I780A  and 
using  the  copy  command  to  copy  the  file  from  the  IMV2A.  Once  the  copy  is 
complete,  delete  “filename2”  from  the  IMV2A  USER  directory. 

CCD  images  were  transfered  to  the  VAX  I780A  for  digital  processing  and  fu¬ 
ture  viewing.  The  two  computer  programs  which  utilized  these  images  the  most 
were  “Executive,”  developed  by  Kobel  and  Martin  to  digitally  implement  the  pat¬ 
tern  recognition  algorithm  proposed  by  Horev  [9],  and  “Hologram.  Menu,”  a  digital 
simulation  of  an  optical  phase-only  filter  designed  to  perform  In  r  —  0  coordinate 
transforms.  The  required  image  data  formats  for  both  programs  were  different  than 
the  CCD  image  format.  Two  programs  were  written  in  ADA  to  convert  the  CCD 
images  into  a  format  useable  by  “Executive”  and  “Hologram.  Menu.”  The  first  pro¬ 
gram  is  called  “CCD_toj256_USB .Format.”  This  program  takes  a  CCD  image  file 
and  reduces  the  image  from  a  512  by  400  array  to  a  256  by  256  unsigned  byte  file. 
This  program  also  has  the  option  to  binarize  the  image.  “Executive”  and  “Hologram. 
Menu”  are  both  designed  to  process  256  by  256  image  arrays. 

The  second  program  is  called  “Format_256_image.”  This  program  gives  the 
user  the  option  of  converting  a  256  by  256  unsigned  byte  image  array  into  the  pseudo 
signed  byte  format  used  in  “Hologram.  Menu”,  or  to  the  complex  number  format  used 
in  “Executive.”  This  program  can  also  convert  image  files  from  “Hologram.  Menu” 
and  “Executive”  back  to  the  unsigned  byte  format.  The  unsigned  byte  format  is  used 
by  the  microVAX  Fortran  display  program  “Display.Images”  which  was  adapted 
from  a  program  of  the  same  name  written  by  Mike  Roggemann. 

The  program  code  for  “CCD_to.256_USB_Format”  and  “Format _256_image” 
and  supporting  software  packages  are  listed  below: 
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Procedure  CCD_T0_256_USB_F0RMAT 


Written  by  J.  Thomas  Walrond  and  Timothy  Childress 


—  Description:  Procedure  takes  an  image  (400  by  512  pixel)  stored  using 

—  the  AT&T  TATGA  frame  grabbing  system  and  reduces  the  image  down  to  a 

—  size  useable  by  the  procedures  Executive  and  Hologram  Menu  (256  by 

—  256  pixels) .  The  procedure  gives  two  options  in  reducing  the  image 

—  size:  sampling  and  cropping.  A  binarizing  option  is  available  for 

—  cleaning  up  binary  templates.  The  binarizing  threshold  value  is  128. 

with  Data.types ;  use  Datatypes; 

with  File_io;  use  File_io; 

with  text_io;  use  text.io; 

with  float_text_io;  use  f loat_text_io ; 

with  inteper_text_io ;  use  integer_text_io ; 

with  IMAGE.T00LS;  use  IMAGE. TOOLS ; 


procedure 

CCD_T0_256_USB_F0RMAT 

nl,  n: 

string(l . .80) ; 

11,  1: 

integer; 

im: 

im_ar(l . .401 , 1 . . 512) 

exim: 

im_ar(l . .512,1. .512) 

fim,  sim: 

im_ar(l . . 256 , 1 . . 256) 

answerl : 

integer; 

answer2 : 

character ; 

begin 

MAIN.L00P : 
loop 

put ("Enter  file  name>  "); 
get_line(n,  1); 
get_image(n(l . .1) ,  1,  im) ; 

put ("CCD  TO  256  BY  256  IMAGE  FORMATING  OPTIONS:"); 
new.line ; 

put("  1.  Expand  ccd  image  to  512  by  512  with  zero  fill"); 


i 
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and  sample  to  256  by  256") ; 


new.line; 
put(" 
new.line; 
new.line; 

put("  2.  Crop  ccd  image  down  to  256  by  256"); 

new.line; 

loop 

put("enter  option  (1  or  2)>"); 
get(answerl) ;  skip.line; 

case  answerl  is 

when  1  =>  EXPAND_CCD.IMAGE(im,  exim) ; 

SAMPLE. IMAGE (EXIH ,  sim) ; 
exit ; 

when  2  =>  CROP_CCD_IMAGE(im,  sim);  exit; 
when  others  =>  null; 
end  case; 
end  loop; 
loop 

put ("Binarize  image  (y  or  n)?>"); 
get(answer2) ;  skip. line; 
case  answer2  is 

when  »y»  |  »Y»  =>  BINARIZE. IMAGE (sim,  f im) ;  exit; 

Put ("binary  complete") ; new.line; 
when  'N*  I  ’n’*>  fim  :=  sim;  exit; 
when  others  =>  null; 
end  case; 
end  loop; 

put("enter  output  file  name>") ; 
get_line(nl,  11);  new.line; 
store_image(f ira,  nl(l..l),  11); 
put("image  is  stored");  new.line; 
loop 

put ("Would  you  like  to  convert  another  ccd  image?  (y/n)>‘ 
get (answer2) ;  skip.line; 
case  answer2  is 

when  'n’  I  »N*  =>  exit  MAIN.LOOP; 
when  'Y’  I  'y'  =>  exit; 

when  others  =>null; 
end  case; 
end  loop; 
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end  loop  MAIN.LOOP; 
end  CCD_to_256_usb_image; 


Procedure  FQRMAT_256_ IMAGE 

—  Written  by  J.  Thomas  Walrond  and  Timothy  Childress 


—  A  main  program  to  convert  256  by  256  images  in  to  the  proper — 

—  data  type  format.  Options  to  convert  from  the  unsigned  byte — 

—  to  the  pseudo-signed  byte  format  used  in  Marvin  Hill’s 

--  Hologram_menu  and  the  complex  format  of  Kobel  and  Martin’s 

—  Executive.  The  program  also  allows  reformatting  back  to  the-- 

—  USB  format . 


WITH  INTEGER.TEXT.IO ; 
with  DATA.TYPES; 

WITH  IMAGE.TOOLS; 

WITH  TEXT, 10; 
with  DECLARATION.HANDLER ; 
with  FILE.IO; 
with  FILE.HANDLER; 
with  DATA.STANDARD; 
with  SSI.IO; 

procedure  F0RMAT.256.IMAGE  is 


USE  INTEGER.TEXT.IO ; 

USE  DATA.TYPES; 

USE  IMAGE.TOOLS; 

USE  TEXT. 10; 

use  DECLARATION.HANDLER ; 
use  FILE.IO; 
use  FILE.HANDLER; 
use  DATA.STANDARD; 
use  SSI.IO; 


n2,  nl 

answerl,  11,12 
answer2 

iml 

max.row, 

im2 


string(l . . 80) ; 

integer; 

character; 

:  im_ar(l . . 256 ,  1..256); 
max.col:  integer; 

: DATA.STANDARD. IMAGE. ARRAY_2D(l. .256,  1.  .256)  ; 


begin 
MAIN.LOOP : 
loop 

— Make  formatting  options  menu 
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new_line; 

put("*******»  256  by  256  IMAGE  FORMATTING  OPTIONS  «*******"); 
new.line;  new_line;  new_line; 

put("  1.  Unsigned  byte  (USB)  to  Hologram_menu  format."); 
new.line; 

put("  2.  USB  to  Executive  (Kobel  and  Martin)  format."); 
new_line; 

put(”  3.  Hologram.menu  to  USB  format."); 
new.line; 

put("  4.  Executive  to  USB  format."); 
new.line;  nev_line; 
put ("ENTER  OPTION  (l-4)>  "); 
get(answerl) ;  skip.line; 

— Get  iput  and  output  filenames 
Put ("ENTER  FILENAME>  "); 
get_line(nl ,11) ; 
new_line; 

loop 

case  answerl  is 

when  1 | 2  =>  get„image(nl(l . . 11) ,  11,  iml);exit; 

when  3  ->  SSI_IQ.read_ssi_image(nl ,11 , im2) ;  exit; 

when  4  =>  FILE_HANDLER.GET_FILE(nl ,  11,  max.row,  max.col) ; exit 

when  others  =>  null; 
end  case; 
end  loop; 

put("Found  file");  new_line; 
put ("ENTER  OUTPUT  FILENAME>  ") ; 
get_line(n2,  12); 
new_line; 

— format  image 
loop 

case  answerl  is 

when  1  =>  USB_T0_H0LQ_F0RMAT(iml ,  n2,  12);exit; 
when  2  =>  USB_TO_EXECUTIVE_FORMAT(iml ,  n2,  12);exit; 
when  3  =>  H0L0_T0_USB_F0RMAT(im2 ,  n2,  12);exit; 

when  4  =>  EXECUTIVE_TO_USB_FORMAT(nl ,  11,  n2 ,  12,  max. col,  max.row) ; 
exit ; 

when  others  =>  null; 
end  case; 
end  loop; 
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Put("Format  another  image?  (y/n)>  "); 
get (answer2) ;  skip_line; 

loop 

case  answer2  is 

when  ’n'  I  'NJ  *>  exit  MAIN_L00P; 
when  'Y'  I  'y'  »>  exit; 
when  others  =>  null; 
end  case; 
end  loop; 

end  loop  MAIN_L00P; 
end  Format _256_ image ; 


PACKAGE  Byte.io 


Declares  a  new  input/output  package  "Byte_io 
which  is  a  instantiation  of  the  sequential.io  package  with 
the  constraint  that  the  data  for  input/output  will  be  of  the 
type  unsigned  byte  in  a  one  dimensional  array — the  data 
used  to  store  the  CCD  camera  images. 

with  sequent ial_io ; 

with  datatypes;  use  datatypes; 

package  byte_io  is  new  sequential.io  (element_type  =>  unsigned_byte_ar_ld) ; 
—  end  Package  Byte_io 


Unsigned_byte_ar_ld  is  a  one  dimensional  array 
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used  to  stored  images,  im.ar  is  a  two  dimensional  array  of 
intergers  used  to  process  images. 


with  system;  use  system; 
package  data.types  is 

type  unsigned_byte_ar_ld  is  array  (integer  range  <>)  of  unsigned_byte ; 
type  im_ar  is  array  (integer  range  <>, integer  range  <>)  of  integer; 

end  data_types; 


Package:  File_io 


Package  contains  two  procedures: 

get_image  and  store_image.  These  procedures  enable 
the  input  and  conversion  of  unsigned  byte  format  images 
into  integer  format  images  for  processing  and  display, 
and  the  storage  of  integer  format  images  in  the  unsigned 
byte  format  (8  bit) . 


with  system;  use  system; 

with  byte.io;  use  byte.io; 

with  data_types;  use  data_types; 

package  file_io  is 


procedure  get_image  (namel:  in  string; 

last:  in  integer; 

image:  out  im_ar  ); 

procedure  store_image(image :  in  im_ar; 

namel:  in  string; 
last:  in  integer); 

end; 
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package  body  file_io  is 


procedure  get_image  (namel:in  string; 

last:  in  integer; 
image:  out  im_ar  )  is 


input:  byte_io.file_type; 

line:  unsigned_byte_ar_ld(image ’ range(2) ) ; 

begin 

open (file=> input , 

mode=>in_f ile, 
name=>namel (1 . . last) ) ; 


for  r  in  image ' range ( 1 )  loop 
read(input,  line); 

for  c  in  image ’ range (2)  loop 

image(r.c)  :=  integer(line(c) ) ; 


end  loop; 
end  loop; 
close(input) ; 
end; 


procedure  store.image (image:  in  ira_ar; 

namel:  in  string; 
last:  in  integer)  is 

output:  byte_io .f ile_type; 

line:  unsigned_byte_ar_ld(image ’ range(2) ) ; 

begin 

create(f ile=>output , 

mode=>out_f ile , 
name=>namel ( 1 . . last) ) ; 

for  r  in  image ' ranged )  loop 
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for  c  in  image ’range (2)  loop 

line(c)  :*  uns igned_byte( image (r, c) ) ; 

end  loop; 

write(output ,  line) ; 
end  loop; 
close(output) ; 
end; 

end  File_io; 


IMAGE.TOOLS 


—  Written  by  J.  Thomas  Walrond  and  Timothy  Childress 


— Description:  Tool  package  created  to  support  the  procedures 
— CCD_to_256_USB_Format  and  Format_256_image .  Individual  description 
— listed  with  each  procedure. 

with  data_types;  use  data_types ; 

with  file_io;  use  file_io; 

with  text_io;  use  text_io; 

with  float _text_io ;  use  f loat_text_io ; 

with  DECLARATION_HANDLER;  use  DECLARATION.HANDLER; 

with  DATA .STANDARD;  use  DATA.STANDARD; 

with  SSI.IO;  use  SSI.IO; 

with  FILE.HANDLER;  use  FILE.HANDLER; 

PACKAGE  IMAGE.TOOLS  IS 

procedure  expand_ccd_image(im  :  in  im.ar; 

eximl:  out  im.ar) ; 

procedure  BINARIZE_IMAGE(im  :  in  ira.ar; 

b_im:  out  im_ar) ; 

procedure  CROP_CCD_IMAGE(im  :  in  im_ar; 

cim:  out  im_ar) ; 


124 


procedure  SAMPLE. IMAGE (im  :  in  im.ar; 

sim:  out  im.ar) ; 


procedure  USB_T0_H0L0_F0RMAT(im 

n 

1 


in  im_ar; 
in  string; 
in  integer) ; 


procedure  USB_TO_EXECUTIVE_FORMAT(im 

n 

1 

procedure  H0L0_T0_USB_F0RMAT(im  : 

n  : 


:  m  im_ar; 

:  in  string; 

:  in  integer) ; 

in  DATA.STANDARD . image_array_2d ; 
in  string; 
in  integer) ; 


procedure  EXECUTIVE_TO_USB_FORMAT(nl 

11 

n2 

12 


Max.col 

Max.row 


in  string; 
in  natural; 
in  string; 
in  natural ; 
in  integer; 
in  integer) ; 


END; 

PACKAGE  BODY  IMAGE.TOOLS  IS 


Procedure  EXPAND_CCD_ IMAGE 


—  Takes  ccd  image  (512  by  401  pixels)  saved  on  the  TARGA  system  and 
expands  the  image  to  512  by  512  by  adding  zeros  evenly  above  and 
--  below  the  image 

procedure  expand_ccd_image(im:  in  im_ar; 

eximl:  out  im_ar)  is 

exim:  im_ar(l . .512,1 . .512)  :=  (others=> (others=>0) ) ; 


begin 
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for  r  in  im’range(l)  loop 
for  c  in  im' range (2)  loop 

exim(r+55,c)  :=  im(r,c); 

end  loop; 
end  loop; 
eximl  :=  exim; 
end; 


Procedure  BINARIZE.IMAGE 


—  Thresholds  images  at  pixel  intensity  value  -  125  (intensity  range  1-256) . 

—  If  <  125  set  pixel  intensity  to  0.  If  not,  then  set  value  to  250. 

procedure  BINARIZE_IMAGE(im:  in  im_ar; 

b_im:  out  im_ar)  is 

begin 

for  r  in  im’range(l)  loop 
for  c  in  im’ range (2)  loop 
if  im(r,c)  <  125  then 
b_im(r,c)  0; 
else 

b_im(r,c)  :=  250; 
end  if; 
end  loop; 
end  loop; 
end; 


Procedure  CR0P_CCD_ IMAGE 


Grabs  the  center  256  by  256  section  of  an  image  grabbed  by  a  TARGA 
—  board  using  sony  ccd  camera. 

procedure  CROP_CCD_IMAGE(im  :  in  im_ar; 

cim:  out  ira_ar)  is 
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begin 

for  r  in  73.. 328  loop 
for  c  in  129.. 384  loop 

cim(r-72,  c-128)  :=  im(r,  c) ; 
end  loop; 
end  loop; 
end; 


Procedure  SAMPLE_512_T0_256_IMAGE 


— Samples  image  by  every  other  row  and  every  other  column  to  reduce  image 
— from  512  by  512  to  256  by  256. 

procedure  SAMPLE_ IMAGE (im  :  in  im_ar; 

sim:  out  im_ar)  is 


begin 

FOR  R  IN  1. .256  LOOP 
FOR  C  IN  1. .256  LOOP 

SIM(R.C)  :=  IM(2*Rf 2*C) ; 
END  LOOP; 

END  LOOP; 
end; 


Procedure  USB _T0_H0L0_ FORMAT 


—  CONVERTS  unsigned  byte  format  images  (already  in  integer  form)  to 

—  the  pseudo-signed  byte  format  used  in  the  Hologram.menu  program 

—  written  by  Marvin  Hill 


procedure  USB_T0_H0L0_F0RMAT ( im 

n 

1 


in  im_ar; 
in  string; 
in  integer)  is 
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im2:  DATA_STANDARD.image_array_2d(im'range(l) ,  im'range(2) ) ; 
begin 

im2:  =  DATA.STANDARD. image_array_2d(im) ; 

SAVE.SSI. IMAGE (n,  1,  im2) ; 
end; 


Procedure  USB.TO.EXECUTIVE.FORMAT 

Converts  images  in  the  unsigned  byte  format  to  the  complex  format  used — 
by  Kobel  and  Martin's  program  Executive. 

procedure  USB_TO_EXECUTIVE_FQRMAT(im  :  in  im_ar; 

n  :  in  string; 

1  :  in  integer)  is 

comp.im:  DECLARATION.HANDLER. image_array_type(im ' range( 1) ,  im’range(2)) ; 
begin 

for  r  in  im'range(l)  loop 
for  c  in  im’ range (2)  loop 
comp_im(r,c) . imag  :*  0.0; 
end  loop; 
end  loop; 

for  r  in  im' range (1)  loop 
for  c  in  im'range(2)  loop 

comp_im(r ,c) .real  :=  float(im(r,c)) ; 
end  loop; 
end  loop; 

FILE_HANDLER.save_f ile(n,  1,  comp_im,  "vid") ; 
end  ; 


Procedure  H0L0_T0_USB_F0RMAT 
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Converts  image  in  the  HOLOGRAM.MENU  image  format  (written  by  Marvin  Hill) 
to  the  unsigned  byte  format  so  the  images  can  be  displayed  on  the  micro  VAX 
using  Mike  Roggemann’s  DISPLAY.IMAGE  FORTRAN  routine. 

procedure  H0L0_T0_USB_F0RMAT(im  :  in  DATA.STANDARD. image_array_2d; 

n  :  in  string; 

1  :  in  integer)  is 

usb.im:  im_ar(im' range (1) ,  im'range(2) ) ; 

begin 

for  r  in  im' range (1)  loop 
for  c  in  im'range(2)  loop 
usb_im(r,c)  :=  im(r,c); 
end  loop; 
end  loop; 

STORE. IMAGE (usb.im,  n(l..l),  1); 
end; 

Procedure  EXECUTIVE.TO.USB.FORMAT 

Convert  image  files  created  with  Kobel  and  Martin's  Executive  to  the  — 
unsigned  byte  format  for  display  using  Mike  Roggemann's  Display. image — 

procedure  EXECUTIVE_TO_USB_FORMAT(nl  :  in  string; 

11  :  in  natural; 

n2  :  in  string; 

12  :  in  natural ; 

Max.col  :  in  integer; 

Max.row  :  in  integer)  is 

comp.im:  image_array_type(l. .max.row,  1.. max.col); 

im:  im_ar(l . .max.col ,  1.. max.col); 
max.val:  float; 
value:  float; 

begin 
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FILE.HANDLER.read.f ile(nl ,  11,  comp.im) ; 


— find  maximum  real  value 
for  r  in  comp.im’ range (1)  loop 
for  c  in  comp.im' range (2)  loop 

if  max.val  <*  comp_im(r,c) .real  then 
max.val  :=  comp_im(r,c) .real; 
end  if; 
end  loop; 
end  loop; 

—  normalize  data 
for  r  in  comp.im’ range (1)  loop 
for  c  in  comp_im' range (2)  loop 

value  :=  (comp_im(r,c) .real/max_val) *256.0; 
im(r,c)  :=  integer (value) ; 
end  loop; 
end  loop; 

STORE. IMAGE (im,  n2(l..l2),  12); 
end; 

END  Image.Tools; 


130 


Appendix  E.  Code  to  Generate  the  4>  Function 


This  section  presents  an  excerpt  from  the  source  code  of  HologramJVlenu  writ¬ 
ten  by  Marvin  Hill.  This  portion  of  the  code  was  used  to  generate  the  phi,  (  $  ), 
function  (described  in  Section  2.4)  used  to  digitally  model  the  optical  In  r  —  0  , 
phase-only  filter.  The  program  adds  the  $  term  to  a  256  by  256  pixel,  real-valued 
image.  Then  Hologram_Menu  Fourier  transforms  the  resultant  complex  image  using 
a  Fast  Fourier  transform  (FFT)  routine  written  by  Kobel  and  Martin. 

The  actual  $  function  is  dependent  upon  the  wavelength  of  the  light  source, 
the  focal  length  of  the  Fourier  transforming  lens,  and  the  position  of  an  image’s  pixels 
with  respect  to  the  image’s  center.  These  scaling  factors  do  not  exist  for  a  FFT. 
Therefore,  the  $  function  was  implemented  with  a  general  multiplicative  scaling 
factor  called  “scale”.  Scale  was  adjusted  for  the  most  accurate  looking  coordinate 
transform  of  a  100  by  100  pixel  square  template.  A  scaling  factor  of  9.0  was  used  to 
generate  the  digital  coordinate  transforms  for  this  thesis. 


procedure  ADD_LNR_THETA_PHASE(  Work:  in  out 
DATA.STANDARD.  Complex.  Array  „.R2d  )  is 

begin  --  ADD_LNR_THETA_PHASE 

—  Multiply  by  Ln  r  -  Theta  phase  and  take  Fourier  Transform 
new.line ; 

XY_P0S(1 .Remarks.line) ; 

put("  '/.REM  -  Multiplying  by  Ln  r  -  Theta  phase  function..."); 
ERASE.LINE.FROM. CURSOR;  new.line; 

for  X  in  Work'range(l) 
loop 
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for  Y  in  Work’range(2) 
loop 


XI  :=  float (  X  -  X_Center  ); 

Y1  :*  float (  257  -  Y  -  Y.Center  ); 

if  XI  /=  0.0  and  Y1  /*  0.0  then 

Phi  :*  2.0  *  pi  *  (  XI  *  (  logC  sqrt(  Xl**2  +  Yl**2  ) 

)-  1.0  )-  Y1  *  atan2(Yl,Xl)  )  /SCALE; 
else 

Phi  :=  0.0; 
end  if; 

In.Real  :a  Work(X,Y) .Real_N; 

In.Img  :a  Work(X.Y) .Img_N; 

Work(X,Y) .Real.N  :=  In.Real  *  cos(Phi)  -  In.Img  *  sin(Phi) ; 
Work(X.Y) .Img_N  :=  In.Real  *  sin(Phi)  +  In.Img  *  cos(Phi); 


end  loop; 
end  loop; 


end  ADD_LNR_THETA_PHASE ; 
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